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1 

V'like Domain Binding Molecules 

Field of the Invention 

The present invention relates to V-like Domain binding molecules with 
5 affinities for target molecules. The present invention also relates to 
compositions comprising these V-like domain binding molecules and to 
methods of diagnosis or treatment which involve the use of these molecules. 
The present invention also relates to a method for selecting V-like Domain 
binding molecules with novel binding affinities and/or specificities. 

10 

Background of the Invention 

Immunoglobulin Superfamilv - Antibody Variable fV] Domains 

Antibodies are the paradigm of specific high-affinity binding reagents 
and provide an antigen binding site by interaction of variable heavy (V H ) and 
15 variable light [VJ immunoglobulin domains. The binding interface is formed 
by six surface polypeptide loops, termed complementarity determining 
regions (CDKs), three from each variable domain, which are highly variable 
and combined provide a sufficiently large surface area for interaction with 
anti*=EEL Specific binding reagents can be formed by association of only the 
V H and V L domains into an Fv module. Bacterial expression is enhanced by 
j n mmg the V-domains with a linker polypeptide into a single-chain scFv 
molecule. Tinman is ation" of recombinant antibodies by grafting murine 
CDR loop structures onto a human Fv framework is disclosed by Winter et al 
EP-239400. 

Methods to improve the expression and folding characteristics of 
single-chain Fv molecules were described by Nieba et al (1997]. The 
properties of single V-domains, derived from natural mammalian antibodies, 
have been described by Gussow et al in WO/90/05144 and EP 0368684B1 and 
by Davis et al in WO/91/08482. Single camelid V-domains have been 
described by Hamers et al in WO/96/34103 and in WO/94/25591. A method 
for reducing the hydrophobicity of the surface of a human V H domain by 
replacing human amino acid sequences with camelid amino acid sequences 
was described by Davies and Riechmann (1994). Methods to exchange other 
regions of human V H sequences with camel sequences to further enhance 
protein stability, including the insertion of cysteine residues in CDR loops, 
were described by Davies and Riechmann (1996). 
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Several attempts to engineer high-affinity single domain binding 
reagents using either the V H or V L domains alone have been unsuccessful, 
due to lack of binding specificity and the inherent insolubility of single 
domains in the absence of the hydrophobic face where the V H and V L 
5 domains interact (Kortt et al, 1995). 
T-cell Receptor Variable fVl Domains 

The T-cell receptor has two V-domains that combine into a structure 
similar to the Fv module of an antibody that results from combination of the 
VH and VL domains. Novotny et al (1991) described how the two V-domains 

10 of the T-cell receptor (termed alpha and beta) can be fused and expressed as 
a single chain polypeptide and, further, how to alter surface residues to 
reduce the hydrophobicity directly analogous to an antibody scFv. Other 
publications describe the expression characteristics of single-chain T-cell 
receptors comprising two V-alpha and V-beta domains (Wulfing and 

15 Pluckthun,1994; Ward, 1991). 

Non-antibodv ligands - CTLA-4 and CD26 V-like Domains 

There are a class of non-antibody ligands which bind to specific 
binding partners which also comprise V-like domains. These V-like domains 
are distinguished from those of antibodies or T-cell receptors because they 

20 have no propensity to join together into Fv-type molecules. These non- 
antibody ligands provide an alternative framework for the development of 
novel binding moieties with high affinities for target molecules. Single 
domain V-like binding molecules derived from these non-antibody ligands 
which are soluble are therefore desirable. Examples of suitable non-antibody 

25 ligands are CTLA-4, CD28 and ICOS (Hutloff et al, 1999). 

Cytotoxic T-lymphocyte associated antigen 4 (CTLA-4) and the 
homologous cell-surface proteins CD28 and ICOS, are involved in T-cell 
regulation during the immune response. CTLA-4 is a 44 kDa homodimer 
expressed primarily and transiently on the surface of activated T-cells, where 

30 it interacts with CD80 and CD86 surface antigens on antigen presenting cells 
to effect regulation of the immune response (Waterhouse et al.1996, van der 
Merwe et al. 1997). CD28 is a 44kDa homodimer expressed predominantly 
on T-cells and, like CTLA-4, interacts with CD80 and CD86 surface antigens 
on antigen presenting cells to effect regulation of the immune response 

35 (Linsley et all 990]. Current theory suggests that competition between CTLA- 
4 and CD28 for available ligands controls the level of immune response, for 
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example, gene deletion of CTLA-4 in knock-out mice results in a massive 
over-proliferation of activated T-cells (Waterhouse et al. 1995]. 

Each CTLA-4 monomelic subunit consists of an N-terminal 
extracellular domain, transmembrane region and C-terminal intracellular 
5 domain. The extracellular domain comprises an N-terminal V-like domain 
(VLD; of approximately 14 kDa predicted molecular weight by homology to 
the immunoglobulin superfamily) and a stalk of about 10 residues connecting 
the VLD to the transmembrane region. The VLD comprises surface loops 
corresponding to CDR-1, CDR-2 and CDR-3 of an antibody V-domain (Metzler 

10 1997). Recent structural and mutational studies on CTLA-4 suggest that 

binding to CDBO and CD86 occurs via the VLD surface formed from A'GFCC* 
V-like beta-strands and also from the highly conserved MYpppy sequence in 
the CDR3-like surface loop (Peach et al. 1994; Morton et al. 1996; Metzler et 
al. 1997). Dimerisauon between CTLA-4 monomers occurs through a 

15 disulphide bond between cysteine residues (Cys 120 ) in the two stalks, which 
results in tethering of the two extracellular domains, but without any 
apparent direct association between V-like domains (Metzler et al. 1997). 
Dimerisation appears to contribute exclusively to increased avidity for the 
ligands. 

20 In vitro Expression of Soluble Forms of CTLA-4. 

Neither the extracellular domains nor V-like domains (VLDs) of human 
CTLA-4 molecule have been successfully expressed as soluble monomers in 
bacterial cells, presumably due to aggregation of the expressed proteins 
(Linsley et al, 1995). Expression of the extracellular N-terminal domain (Met 1 
to Asp 1 ", comprising Cys 120 ) in B.coli results in production of a dimeric 28 
kDa MW protein, in which two CTLA-4 V-like domains are joined by a 
disulphide linkage at Cys 120 . Truncation at Val 1 " removes these cysteines 
and was intended to enable expression of a 14 kDa VLD in soluble, 
monomeric form. However, the product aggregated and it was concluded that 
hydrophobic sites, which were normally masked by glycosylation, were now 
exposed and caused aggregation (Linsley et al, 1995). 

There have been some reports of successful expression of monomeric, 
glycosylated CTLA-4 extracellular domains in eukaryotic expression systems 
(ie CHO cells and the yeast Pichia pastoris; Linsley et al. 1995; Metzler et al. 
1997; Gerstmayer et al. 1997). Glycosylation in these eukaryotic expression 
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systems is presumed to occur at the two N-linked glycosylation sites in the 
VLD (Asn76 and Asnl08 ). However, high yields have only been described 
for expression of a gene encoding a CTLA-4 VLD fused to Ig-CH2/CH3 
domains which produces a dimeric recombinant protein with 2 CTLA-4 VLDs 
5 attached to an Fc subunit (WO 95/01994 and AU 16458/95). AU 60590/96 ■ 
describes mutated forms of CTLA-4 VLDs with single amino acid 
replacements of the first tyrosine (Y) in the MYPPPY surface loop which 
retain and modifies the affinity for the natural CD80 and CD86 ligands. AU 
60590/96 describes the preferred soluble form of CTLA-4 VLDs as a 
10 recombinant CTLA-4/Ig fusion protein expressed in eukaryotic cells and does 
not solve the aggregation problem in prokaryote expression systems. EP 
O757099A2 describes the use of CTLA-4 mutant molecules, for example the 
effect of changes on ligand binding of mutations in the CDR3-like loop. 

15 Summary of the Invention 

The present inventors have now developed novel binding molecules 
derived from the V-like domains (VLDs) of non-antibody ligands such as 
CTLA-4, CD28 and ICOS. Replacement of CDR loop structures within the 
VLDs results unexpectedly in the production of monomelic, correctly folded 

20 molecules with altered binding specificities and improved solubility. 

Accordingly, in a first aspect the present invention provides a binding 
moiety comprising at least one monomeric V-like domain (VLD) derived from 
a non-antibody ligand, the at least one monomeric V-like domain being 
characterised in that at least one CDR loop structure or part thereof is 

25 modified or replaced such that the solubility of the modified VLD is 
improved when compared with the unmodified VLD. 

Within the context of the present invention, the modification or 
replacement may involve any change to one or more physical characteristics 
(such as size, shape, charge, hydrophobicity etc) of the at least one CDR loop 

30 structure. The modification or replacement may result in a reduction in the 
size of the at least one CDR loop structure. In a preferred embodiment, 
however, at least one CDR loop structure or part thereof is modified or 
replaced such that 

(i) the size of the CDR loop structure is increased when compared 

35 with corresponding CDR loop structure in the unmodified VLD; and/or 



WO 99/451 10 



PCT/AU99/00136 



5 

(ii) the modification or replacement results in the formation of a 
disulphide bond within or between one or more of the CDR loop structures. 

In a second aspect, the present invention provides a binding moiety 
comprising at least one monomelic V-like domain (VLD) derived from a non- 
5 antibody ligand, the at least one monomelic V-like domain being 
characterised in that at least one CDR loop structure or part thereof is 
modified or replaced such that 

(i) the size of the CDR loop structure is altered when compared with 
corresponding CDR loop structure in the unmodified VLD; and/or 
10 (ii) the modification or replacement results in the formation of a 

disulphide bond within or between one or more of the CDR loop structures. 

In a preferred embodiment of the second aspect, the size of the CDR 
loop structure is increased by at least two, more preferably at least three, 
more preferably at least six and more preferably at least nine amino acid 
15 residues. 

In a further preferred embodiment, the modified binding moiety of the 
first or second aspect of the present invention also exhibits an altered 
binding affinity or specificity when compared with the unmodified binding 
moiety. Preferably, the effect of replacing or modifying the CDR loop 
20 structure is to reduce or abolish the affinity of the VLD to one or more 

, t natural ligands of the unmodified VLD. Preferably, the effect of replacing or 
modifying the CDR loop structure is also to change the binding specificity of 
the VLD . Thus it is preferred that the modified VLD binds to a specific 
binding partner which is different to that of the unmodified VLD. 

The phrase "V-like domain" or "VLD M is intended to refer to a domain 
which has similar structural features to the variable heavy (Vh) or variable 
light (VJ antibody. These similar structural features include CDR loop 
structures. By "CDR loop structures" we mean surface polypeptide loop 
structures or regions like the complementarity determining regions in 
antibody V-domains, 

The phrase "non-antibody ligand" is intended to refer to any ligand 
which binds to a specific binding partner and which is not an antibody or a 
T<ell receptor. Examples of suitable non-antibody ligands are T-cell surface 
proteins such as CTLA-4, CD28 and ICOS. It will be appreciated by those 
skilled in the art that other non-antibody ligands which may provide V-like 
domains suitable for the invention are other T-cell surface proteins such as 
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CD2, CD4, CD7 and CD16; B cell surface proteins such as CD 19, CD79a, 
CD22, CD33, CD80 and CD86; adhesion molecules such as CD48, CD541CAM 
and CD5 8. These molecules, which are listed in Table 1, provide a non- 
exhaustive list of structures which may form the basis for the single domain 
5 binding molecules of the present invention. 

The phrase "V-like domain derived from a non-antibody ligand" is 
intended to encompass chimeric V-like domains which comprise at least part 
of a V-like domain derived from a non-antibody ligand. 

0 

Table i: Non-antibody Ligands 



Molecule 


Size 


Structure 


Tcell Surface Pro loins 






CD2 


45-58 kDa 


VC 1 domains 


CD4 


5SkDa 


V2C2 


CD7 


40kDa 


V domain 


CD 16 


50-05 kDa 


2x C domains 


B cell Surface Proteins 






CD19 


35kDa 


Zx C domains 


CD790 


33kDa 




CD22 


130-140kDa 


lxV BxC domains 


CD33 


S7kDa 


VC domain 


CD80 


60kDa 


VC domain 


CD88 


GOkDa 


VC domain 


Adhesion molecules 






CD4R 


45 kDa 


VC domain 


CDS 4 1 CAM 


BS-llOkDa 




CD58 


55-70kDa 


VC domain 



1 V a variable Ig domain, C - consUnt domain 

15 

These molecules are discussed in (1J The Leucocyte Antigen Facts Book, 1993, Eds Barclay 
ol al., Academic Press, London; and (2) CD Antigens 1996 (1997) Immunology Today 18. 
100-101, the entire coutonts of which are incorporated herein by reference. 
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The "solubility" of modified binding moieties of the present invention 
correlates with the production of correctly folded, monomeric domains. The 
solubility of the modified VLDs may therefore be assesed by HPLC. For 
example, soluble (monomeric) VLDs will give rise to a single peak on the 
5 HPLC chromatography whereas insoluble (eg. multimeric and aggregated) 
VLDs will give rise to a plurality of peaks. A person skilled in the art will 
therefore be able to detect an increase in solubility of modified VLDs using 
routine HPLC techniques. 

It will be appreciated that the binding moieties of the present 
invention may be coupled together, either chemically or genetically, to form 
multivalent or multifunctional reagents. For example, the addition of C- 
terminal tails, such as in the native CTLA-4 with Cys lz0 , will result in a 
dimer. 

The binding moieties of the present invention may also be coupled to 
other molecules for various diagnostic formulations. For example, the VLDs 
may comprise a C- terminal polypeptide tail or may be coupled to streptavidin 
or bio tin for multi-site in vitro assays. The VLDs may also be coupled to 
radioisotopes, dye markers or other imaging reagents for in vivo detection 
and/or localisation of cancers, blood clots, etc. The VLDs may also be 
immobilised by coupling onto insoluble devices and platforms for diagnostic 
and biosensor applications. 

In a most preferred embodiment of the first aspect of the present 
invention, the V-like domain is derived from the extracellular domain of the 
CTLA-4 molecule or the CD28 molecule. In a further preferred embodiment 
one or more surface loops of the CTLA-4 V-like domain and preferably the 
CDR-1, CDR-2 or CDR-3 loop structures are replaced with a polypeptide 
which has a binding affinity for a target molecule of interest. Target 
molecules of interest comprise, but are not limited to, drugs, steroids, 
pesticides, antigens, growth factors, tumour markers, cell surface proteins or 
viral coat proteins. It will be appreciated that these VLDs may be 
polyspecific, having affinities directed by both their natural surfaces and 
modified polypeptide loops. 

In a further preferred embodiment the effect of replacing or modifying 
the CTLA-4, CD26 and 1COS V-like domain surface loops is to abolish the 
natural affinity to CD80 and CD8B. 
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In one preferred embodiment, one or more of the CDR loop structures 
of the VLD ere replaced with one or more CDR loop structures derived from 
an antibody. The antibody may be derived from any species. In a preferred 
embodiment, the antibody is derived from a human, rat, mouse, camel, llama 
5 or shark. The antibody or antibodies may be selected from the camel 
antibody cAB-Lys3 and the human anti-melanoma antibody V86. 

In a further preferred embodiment, one or more of the CDR loop 
structures are replaced with a binding determinant derived from a non- 
antibody polypeptide. For example, one or more of the CDR loop structures 
10 may be replaced with a polypeptide hormone, such as somatostatin which is 
a 14 residue intra-disulphide bonded polypeptide important in cancer cell 
recognition, or with a viral protein such as the human influenza vims 
haemagglutinin protein. 

In a further preferred embodiment the V-like domain of the binding 
15 moiety comprises CDR loop structures homologous in character to CDR loop 
structures found in camelid or llama antibodies. For example, the CDR loop 
structures may contain one or more non-conventional substitutions (eg. 
hydrophobic to polar in nature). In another preferred embodiment, the CDR- 
1 and CDR-3 loop structures may adopt non-canonical conformations which 
20 are extremely heterologous in length. The V-like domain may also possess a 
disulphide linkage interconnecting the CDR-1 and CDR-3 loop structures (as 
found in some camel V„H antibodies) or the CDR-2 and CDR-3 loop 
structures (as found in some llama V H H antibodies). 

In a third aspect the present invention provides a polynucleotide 
25 encoding a binding moiety of the first or second aspect of the present 
invention. The polynucleotide may be incorporated into a plasmid or 
expression vector. 

In a fourth aspect the present invention provides a prokaryotic or 
eukaryotic host cell transformed with a polynucleotide according to the third 
30 aspect of the present invention. 

In a fifth aspect the present invention provides a method of producing 
a binding moiety which comprises culturing a host cell according to the 
fourth aspect of the present invention under conditions enabling expression 
of the binding moiety and optionally recovering the binding moiety. 
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In a preferred embodiment of the present invention the binding moiety 
is produced by expression in a bacterial host Preferably, the binding moiety 
is unglycosylated. 

In a sixth aspect the present invention provides a pharmaceutical 

5 composition comprising a binding moiety of the first or second aspect of the 
present invention and a pharmaceutically acceptable carrier or diluent 

In a seventh aspect the present invention provides a method of treating 
a pathological condition in a subject, which method comprises administering 
to the subject a binding moiety according to the first or second aspect of the 

0 present invention. 

For in vivo applications it is preferable that VLDs are homologous to 
the subject of treatment or diagnosis and that any possible xenoantigens are 
removed. Accordingly it is preferred that VLD molecules for use in clinical 
applications are substantially homologous to naturally occurring human 

5 immunoglobulin superfamily members. 

In an eighth aspect the present invention provides a method of 
selecting a binding moiety with an affinity for a target molecule which 
comprises screening a library of polynucleotides for expression of a binding 
moiety with an affinity for the target molecule, the polynucleotides encoding 

0 VLDs derived from one or more non-antibody ligands, wherein the 
polynucleotides have been subjected to mutagenesis which results in a 
modification or replacement in at least one CDR loop structure in at least one 
VLD and wherein the solubility of the isolated modified VLD is improved 
when compared with the isolated unmodified VLD. 

5 It will be appreciated by those skilled in the art that within the context 

of the eighth aspect of the present invention, any method of random or 
targetted mutagenesis maybe used to introduce modifications into the V-like 
domains. In a preferred embodiment, the mutagenesis is targetted 
mutagenesis. Preferably, the targetted mutagenesis involves replacement of 

3 at least one sequence within at least one CDR loop structure using splice 
overlap PCR technology. 

It will also be appreciated by those skilled in the art that the 
polynucleotide library may contain sequences which encode VLDs 
comprising CDR loop structures which are substantially identical to CDR 

3 loop structures found in naturally occurring immunoglobulins as well as 
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sequences which encode VLDs comprising non-naturally occurring CDR loop 
structures. 

In a preferred embodiment of the eighth aspect of the present 
invention, the screening process involves displaying the modified V-iike 
5 domains as gene in protein fusions on the surface of bacteriophage particles. 
The library may comprise bacteriophage vectors such as pHFA, fd-tet-dog or 
pFAB.5c containing the polynucleotides encoding the V-like domains. 

In a further preferred embodiment of the eighth aspect, the screening 
process involves displaying the modified V-like domains in a ribosomal 
10 display selection system. 

Throughout this specification, the word "comprise' 1 , or variations such 
as "comprises" or "comprising", will be understood to imply the inclusion of a 
stated element, integer or step, or group of elements, integers or steps, but 
not the exclusion of any other element, integer or step, or group of elements, 
15 integers or steps. 

Brief Description of the Drawings 

Figure 1 : CTLA-4 VLD-Specific Oligonucletides. 

20 

Figure 2: Polynucleotide sequence of complete cDNA encoding Lm i i^ 
CTLA-4 and polypeptide sequence of the VLD of human CTLA-4. 

Figure 3 : Display of CTLA-4 VLD STMs as gene 3 fusions on the surface of 
25 phage or phagemid. CTLA-4 VLD STMs are depicted as black spheroids; 
gene 3 protein is depicted as white spheroids; FLAG polypeptide is depicted 
in grey; genes are marked in a similar colour code and are depicted in an 
oval phage (mid) vector. 

30 Figure 4: Schematic representation of the somatostatin polypeptide. 
Somatostatin (somatotropin release-inhibiting factor SRIF) is a cyclic 14- 
amino acid polypeptide. The cyclic nature is provided by a disulphide 
linkage between the cysteine residues at positions 3 and 14. The four 
residues which constitute the tip of the loop (Phe-Trp-Lys-Thr) are 

35 implicated in binding to members of the somatostatin receptor family. 
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Figure 5: Size exclusion HPLC profiles of affinity purified CTLA-4 VLD and 
CTLA-4-Som3 STM. Recombinant human CTLA-4 proteins were expressed 
in E. coli host TGl from vector pGC, purified from peripiasmic extracts by 
anti-FLAG affinity chromatography and subjected to size exclusion 
5 chromatography on a calibrated Superose 12 HR column. The elution 

profiles of purified CTLA-4 VLD and CTLA-4-Som3 STM are overlayed in this 
graph. CTLA-4 VLD comprises tetramer (21.86 rain), dimer (26.83) and 
monomer (29.35 min). CTLA-4-Som3 STM comprises dimer (26.34) and 
monomer (29.28). Traces represent absorbance at 214 nm and are given in 
10 arbitrary units. 

Figure 6: Schematic diagram of CTLA-4 VLD loop replacements. Construct 
A (CTLA-4 VLD: S2) represents the wild-type CTLA-4 extracellular V- 
doinain, spanning residues 1-115. Constructs B (CTLA-4-Soml; PP2) and C 

15 (CTLA-4-Soml-Cysl20; PP5) both contain the 14 residue somatostain 
polypeptide in CDRl. PP5 also carries a C-terniinal extension containing 
Cysl20. Construct D (CTLA-4-Som3; PP8) contains the 14 residue 
somatostatin polypeptide in place of CDR3. In construct E (CTLA-4-HA2: 
XX4), CDR2 has been replaced with a haemagglutinin tag. In construct F 

20 (CTLA-4-Soml-Som3: W3j, both CDRl and CDR3 have been replaced with 
the somatostain polypeptide. In construct G (CTLA-4-Som-HA2-Som3: ZZ3) 
CDRl and CDR3 are replaced with the somatostatin polypeptide whilst CDR2 
is replaced with a haemagglutinin tag. In construct H (CTLA-4-anti-lys:2V8), 
all three CDR loop structures have been replaced with the CDR loops from a 

25 camel anti-lysozyme V H H molecule. Construct I (CTLA-4-anti-mel: 3E4) 
represents CTLA-4 VLD in which all three CDRs have been replaced by the 
VH CDR loops from anti-melanoma antibody V86 (Cai And Garen, 1997). 
PelB, cleavable pectate lyase secretion sequence (22 aa); flag, dual flag tag 
( AAADYKDDDDKAADYKDDDDK) . 

30 

Figure 7: HPLC profiles of purified recombinant human CTLA-4 STMs. 
Recombinant CTLA-4 VLDs were expressed in E. coli host TGl from vector 
pGC, purified from peripiasmic extracts by anti-FLAG affinity 
chromatography and subjected to size exclusion chromatography on a 
35 calibrated Superose 12 HR column. The elution profiles of the purified 

proteins are shown. Panel A, CTLA-4 DIMER (PP5); Panel B, CTLA-4R (S2); 
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Panel C t CTLA-4-HA2 (XX4); Panel D, CTLA-4-Som3 [PP8J; Panel E. CTLA- 
4-Soinl (PP2); Panel F, CTXA-4-Soml-Som3 (W3); Panel G, CTLA-4-Som- 
HA2-Som3 (ZZ3); Panel H, CTLA-4-anti-lys (2V8); Panel I, CIXA-4-anti-mel 
(3E4). ). Traces represent absorbance at 214 nm and are given in arbitrary 
5 units. 

Figure 8: Comparison by size exclusion FPLC analysis of affinity purified 
CTLA-4 constructs synthesised using bacterial expression vector pGC or 
pPOW. Recombinant human CTLA-4R or its loop variants were expressed in 

ID E. coli host TOP10F, purified from periplasms extracts by anti-FLAG affinity 
chromatography and subjected to size exclusion chromatography on a 
calibrated Superose 12 HR column. The elution profiles of proteins 
expressed from vector pGC are shown on the left, whilst proteins expressed 
from vector pPOW are shown on the right Panel A, wild -type CTLA-4 VXD 

15 (S2); B, CTLA-4-Soml[PP2); C, CTLA-4-Som3(PP8); D, CTLA-4-Anti- 
lys(2V8); E f CTLA-4-Soml-HA2-Som3(ZZ3). 

Figure 9: Protein stability analysis. Stability of monomer preparations of 
CTLA-4 VLD and loop variant constructs was analysed by size exclusion fpk 
20 chromatography on a precalibrated superose 12 hr (Pharmacia) column 
following several cycles of freeze/thawing. Aliqnots of **rh protein were 
tested immediately after peak purification and following two cycles of 
freeze/thawing. A, CTLA-4 VXD (S2); B, CTLA-4-Soml (PP2); C, CTLA-4- 
Som3 (PP8); D, CTLA-4-anti-lys (2V8); E, CTLA-4-Son>HA2-Som3 (ZZ3). 

25 

Figure 10: Lysozyme binding characteristics of CTLA-4-anti-Iys construct 
2V8. 

Figure 11: Screening of CTLA-4 VLD phagemid library on immobilised Sh 
30 bleomycin. 

Figure 12: Screening of CTLA-4 VLD libraries in solution 

Detailed Description of the Invention 

35 The present invention relates to the design of novel soluble VLD 

binding molecules derived from the V-like domain of immunoglobulin 



WO 99/45110 



PCT/AU99/00136 



13 

superfamily members, such as the human CTLA-4 molecule. The preferred 
binding molecules of the present invention provide the following advantages 
(i) use of a native human protein obviates the need for subsequent 
humanisation of the recombinant molecule, a step often required to protect 
5 against immune system response if used in human treatment; (ii) the domain 
is naturally monomelic as described above (incorporation of residue CyslZO 
in a C-terminal tail results in production of a dimeric molecule); and (iii) 
structural modifications have resulted in improved E.coli expression levels. 

Prior to publication of the first CTLA-4 structure determination, 
available sequence data and mutational analyses of both this molecule and 
CD28 were analysed. This allowed modelling and prediction of the regions 
corresponding to antibody CDRl, 2 and 3 regions. It was hypothesised that 
such areas would be susceptible to mutation or substitution without 
substantial effect upon the molecular framework and hence would allow 
expression of a correctly folded molecule. The subsequently published 
structure (Metzler et al. 1997) showed these predictions to be accurate, 
despite the unexpected separation of CDRl from the ligand-binding site, and 
the extensive bending of CDR3 to form a planar surface contiguous with the 
ligand binding face. 

In an initial set of experiments the V-like domain of the human CTLA- 
4 molecule was modified by replacement of CDR loop structures with either 
of two defined polypeptides. The two polypeptides were human 
somatostatin (Som) and a portion of the human influenza virus 
haemagglutinin protein (HA -tag). Somatostatin (SRIF: somatotropin release- 
inhibiting factor) is a 14 residue polypeptide comprising a disulphide bond 
that forces the central 10 residues into a loop. Human somatostatin is 
biologically widespread within the body and mediates a number of diverse 
physiological functions such as regulation of growth hormone secretion etc 
(Reisne, 1995). Human somatostatin binds a number of specific receptors 
(there are at least five subtypes) which have differing tissue specificities and 
affinities (Schonbrunn et al. 1995). These aspects of binding and activation 
are as yet poorly understood, but one salient feature is the high density of 
somatostatin receptors present on a number of cancerous cell lines, for 
example cancers of the neuro-endocrine system and small lung cancers 
(Reubi 1997). Artificial analogues of somatostatin have been produced for 
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imaging of such tumours which are resistant to degradation compared with 
the highly labile somatostatin polypeptide. 

The haemagglutinin epitope sequence consists of the 9 residues 
YPYDVPDYA. A commercially produced antibody is available which 
5 specifically recognises this sequence. The epitope tag can be detected when 
randomly or directionally incorporated within the structure of proteins 
(Canfield et al. 1996). 

Replacement of one or more CDR loop structures in the CTLA-4 V-like 
domain with somatostatin or the HA-tag resulted in the production of 

10 soluble, monomeric, unglycosylated binding molecules using different 
bacterial expression systems. This surprising finding shows that V-like 
domains provide a basic framework for constructing soluble, single domain 
molecules, where the binding specificity of the molecule may be engineered 
by modification of the CDR loop structures. 

15 The basic framework residues of the V-like domain may be modified in 

accordance with structural features present in camelid antibodies. The camel 
heavy chain immunoglobulins differ from "conventional" antibody structures 
by consisting of only a single VH domain (Hamers-Castennan et al 1993]. 
Several unique features allow these antibodies to overcome the drsl 

20 problems of solubility and inability to present a sufficiently large antigen 
binding surface. 

First several non-conventional substitutions [predominanliy 
hydrophobic to polar in nature) at exposed framework residues reduce the 
hydrophobic surface, while maintaining the internal beta-sheet framework 

25 structure (Desmyter et al. 1996). Further, within the three CDR loops several 
structural features compensate for the loss of antigen binding-surface usually 
provided by the VL domain. While the CDR2 loop does not differ extensively 
from other VH domains, the CDR-1 and -3 loops adopt non-canonical 
conformations which are extremely heterologous in length. For example, the 

30 Hi loop may contain anywhere between 2-8 residues compared to the usual 
hve in Ig molecules. However, it is the CDR3 loop which exhibits greatest 
variation: in 17 camel antibody sequences reported, the length of this region 
varies between 7 and 21 residues (Muyldermans et al. 1994). Thirdly, many 
camelid VH domains possess a disulphide linkage interconnecting CDRs -1 

35 and -3 in the case of camels and interconnecting CDRs -1 and -2 in the case 
of llamas (Vu et al. 1997). The function of this structural feature appears to 
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be maintenance of loop stability and providing a more contoured, as distinct 
from planar, loop conformation which both allows binding to pockets within 
the antigen and gives an increased surface area. However, not all camelid 
antibodies possess this disulphide bond suggesting that it is not an absolute 

5 structural requirement 

These foregoing features have enabled camelid V-domains to present 
as soluble molecules in vivo and with sufficiently high affinity to form an 
effective immune response against a wide variety of target antigens. In 
contrast, cell surface receptors of the Ig superfaxnily [such as CD4 and CD2) 

0 comprise V-like binding domains and appear to bind cognate receptors with 
surface features other than the CDR loops. These V-like domains bind to 
cognate receptors with very low affinity. Physiological signalling between 
two cells are mediated by the avidity of multi-point binding, when two cell 
surfaces connect and each contains multiple receptors. CD2 is a well- 

5 characterised example: binding to CD58 is mediated by a highly constrained 
set of minor electrostatic interactions generated by charged and polar 
residues located in the GFCC'C" face (not the antibody type CDR-1, CDR-2 or 
CDR- 3 loops). This results in a low affinity but highly specific interaction 
(Bodian et al 1994). 

0 The present invention also relates to a method for generating and 

selecting single VLD molecules with novel binding affinities far target 
molecules. This method involves the application of well known molecular 
evolution techniques to V-like domains derived from members of the 
immunoglobulin superfamily. The method may involve the production of 

5 phage or ribosomal display libraries for screening large numbers of mutated 
V-like domains. 

Filamentous fd-bacteriophage genomes are engineered such that the 
phage display, on their surface, proteins such as the Ig-like proteins (scFv, 
Fabs) which are encoded by the DNA that is contained within the phage 

0 (Smith, 1985; Huse et al., 1989; McCafferty et al., 1990; Hoogenboom et al., 
1991). Protein molecules can be displayed on the surface of Fd 
bacteriophage, covalently coupled to phage coat proteins encoded by gene III, 
or less commonly gene VIII. Insertion of antibody genes into the gene HI coat 
protein gives expression of 3-5 recombinant protein molecules per phage, 

5 situated at the ends. In contrast, insertion of antibody genes into gene VIII 
has the potential to display about 2000 copies of the recombinant protein per 
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phage particle, however this is a multivalent system which could mask the 
affinity of a single displayed protein. Fd phagemid vectors are also used, 
since they can be easily switched from the display of functional Ig-Iike 
fragments on the surface of Fd -bacteriophage to secreting soluble Ig-like 
5 fragments in E. coli. Phage-dis played recombinant protein fusions with the 
N-terminus of the gene in coat protein are made possible by an amber codon 
strategically positioned between the two protein genes. In amber suppressor 
strains of E. coli, the resulting Ig domain-gene HI fusions become anchored in 
the phage coat 

10 A selection process based on protein affinity can be applied to any 

high-affinity binding reagents such as antibodies, antigens, receptors and 
ligands (see, for example, Winter and Milstein, 1991, the entire contents of 
which are incorporated herein by reference). Thus the selection of the 
highest affinity binding protein displayed on bacteriophage is coupled to the 

15 recovery of the gene encoding that protein. Ig-displaying phage can be 

affinity selected by binding to cognate binding partners covalently coupled to 
beads or adsorbed to plastic surfaces in a manner similar to EUSA or solid 
phase radioimmunoassays. While almost any plastic surface will adsorb 
protein antigens, some commercial prod acts Ere especially formulated for 

20 this purpose, such as Nunc Immunotubes. 

Ribosomal display libraries invcbe polypeptides synthesised de novo 
in cell-free t ransla tion systems and displayed on the surface of ribosomes for 
selection purposes (Hanes and Pluckthun, 1997; He and Taussig, 1997). The 
"cell-free translation system" comprises ribosomes. soluble enzymes required 

25 for protein synthesis (usually from the same cell as the ribosomes), transfer 
RNAs, adenosine triphosphate, guanosine triphosphate, a ribonucleoside 
triphosphate regenerating system (such as phosphoenol pyruvate and 
pyruvate kinase), and the salts and buffer required to synthesize a protein 
encoded by an exogenous mRNA. The translation of polypeptides can be 

30 made to occur under conditions which maintain intact polysomes, i.e. where 
ribosomes, mRNA molecule and translated polypeptides are associated in a 
single complex. This effectively leads to n ribosome display" of the translated 
polypeptide. 

For selection, the translated polypeptides, in association with the 
35 corresponding ribosome complex, are mixed with a target molecule which is 
bound to a matrix (e.g. Dynabeads). The target molecule may be any 
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compound of interest (or a portion thereof) such as a DNA molecule, a 
protein, a receptor, a cell surface molecule, a metabolite, an antibody, a 
hormone or a virus. The ribosomes displaying the translated polypeptides 
will bind the target molecule and these complexes can be selected and the 
5 mRNA re-amplified using RT-PCR. 

Although there are several alternative approaches to modify binding 
molecules the general approach for all displayed proteins conforms to a 
pattern in which individual binding reagents are selected from display 
libraries by affinity to their cognate receptor. The genes encoding these 
reagents are modified by any one or combination of a number of in vivo and 
in vitro mutation strategies and constructed as a new gene pool for display 
and selection of the highest affinity binding molecules. 

In order that the nature of the present invention may be more clearly 
understood preferred forms thereof will now be described with reference to 
the following examples. 

Example l 

Gene Construction and Cloning 

CTLA-4 STM (STM: soluble truncated mutants of CTLA-4, nsed herein 
to describe CTLA-4 chimaeric V-like domain proteins) gene consixucrion and 
cloning was by standard and well-described techniques (Polymerase chain 
reaction with specifically designed □ligonuckatkk primers, splice overlap 
extension, restriction enzyme digests etc). A list of oligonucleotide primers 
used is given in Figure 1. 

The wild-type STM construct was amplified from cloned human 
CTLA-4 DNA (Figure 2) (and could be similarly amplified from reverse 
transcribed human cDNA by a competent worker in the field) using the 
oligonucleotide primers #3553 and #4316, which incorporated Sfii and NotI 
restriction sites at the 5* and 3' ends respectively. These terminal primers 
were used in all further constructions except: (i) where #4851 or #5443 was 
used to incorporate an ApaLl site at the 5' end; (ii) where #4486 was used to 
add a C- terminal tail including residue Cysl20; (iii) where #5467 was used to 
incorporate an EcoRl site at the 5' end; and (iv) where the specific set of 
extension primers were used for ribosomal display. 

A splice overlap PCR strategy using combinations of the 
oligonucleotides primers listed in Figure 1 was used to produce variations of 
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CDR-1, CDR-2 and/or CDR-3 loop structure replacements. The variations, 
which are described in greater detail in the following examples are listed in 
Table 2. 



5 TABLE 2 



10 



CDfl-1 combinations 



CTLA-4 VLB 

AiilMysozyme 
Somatostatin- 14 
Anti-melanoma 



CDR-1 

S 1B FVC£VA.SPGKATE... 



VRV... 



S'-FVCEYA-SGYTIGPYCMG VRV.. 

S 1 B hVCEYA JVGCKNFFWKTFTSCATE. VRV„ 
S 19 FVCEYA.SGFTFSSYAMS VRV.. 



20 



Randomisation 1 


S 19 FVCEYA.XXXXXXXG 


VRV.. 


Randomisation 2 


S ,S FVCEYAJQCXXXXXXCXG. 


VRV... 


Randomisation 3 


S ta FVCEV> TTarXarXXaiCXG 


VRV— 


Randomisation 4 


S llk FVCEYA^3PGXSXX 


VRV... 


Randomisation 5 




VRV... 




S^CEYA.! 1 I X I X V XATtL. 


VRV... 


^Hi^iiQTnr^rntTn 7 


S^CEYAJLl X X X X CXATE. 


VRV_. 


Randomisation 6 


S^CEYAj^GC^NTXXXXXXTSCATE. 


VRV... 



25 

CDR-2 combinations 

CDR-2 

C11A-4 VLD Q M VTEVCAA.TYMMGNELTF.LDDSICT... 



30 AnU-lysozyme Q 4 *VTEVCAAjUNMGGGnT.LDDSICT... 

Haomagglutinin tag Q 44 VTEVCAA.TYPYDVPDYA.LDDSICT... 

Anti-melanoma Q 4 VtEVCAAj«SGSGGSTY.LDDS1CT... 

Randomisation 1 Q* 4 VTEVCAA.TYXXGXELTF.LDDSICT... 

35 Randomisation 2 Q 4 *VTEVCAA.CYXXGXELTFXDDSICT... 
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CDR-3 combinations 

CDR-3 

CTLA~4 VLD C^KV.ELMYPPPYYL GIG... 

5 Anti-lysozyme C° 3 KV.DSTIYASYTECGHGLSTGGYGYDS. GIG... 

Somatostatin-14 C"KV.EAGCKNFPWKTFTSC. GIG... 

Aiiti-melnuoma C° 3 KV.GWGLRGEEGDYYMDV GIG... 

Randomisation 1 C° 3 KV.XXXXXXXXXXX GIG... 

10 Randomisation 2 C"KV.XXJOCXXXXXXXXXXX GIG... 

Randomisation 3 C^KV.XXXXXX C.XXXX. GIG... 

Randomisation 4 C^KV.XXXXXXJLC.XXXX GIG... 

Randomisation 5 C 93 KV JCXXXXXXX. CJCXXXX GIG... 

Randomisation 6 ^KV.XXXXXXXXXX.CXXXXXXX..... GIG- 

15 

Randomisation 7 C^KVXXXXXXXXXX. QG._ 

Randomisation 8 C ej KV.EXXXXXX.CXXXXXXX- GIG™ 

Randomisation 9 C^KV.EAGCKNXXXXXXTSC OG— 

20 For generation of randomised sections of the rnp loop structures, 

similar splice-overlap techniques were used with oligcimcieotides where a 



given triple t(s) were encoded by the sequence NNg/T where N represents any 
of the four possible nucleotide bases. This combination covers ali possible 
amino acid residues. Alternatively, randomisation was biased towards 
25 certain subsets of amino acids (for example aromatic residues, Figure i t 
#5452). 

In some instances, a variant technique was used for STM gene 
construction, where randomised oligonucleotide primers were designed 
which incorporated restriction sites for direct cloning into the similarly 

30 modified (with complementary restriction sites) CTLA-4 VLD framework (for 
example Figure 1, #4254). 

Completed constructs were cut with approriate combinations of 
restriction enzymes (for example Sfil.Notl, ApaLl, EcoRl) and cloned into 
like sites in appropriate expression vectors. These vectors comprise: (i) for 

35 production of soluble protein expression vectors pGC (Coia et al, 1996) and 
pPOW (Power et al, 1992; Kortt et al. 1995) (it) for bacteriophage and 
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phagemid display, completed STM constructs were cut with the restriction 
enzymes Sfil and NotI or ApalJ and NotI and cloned into the vectors pHFA, 
and pFAB.Sc (phagemid) or pfd-Tet-DOG (phage). These vectors allow 
display of the STMs as gene3 protein fusions on the surface of bacteriophage 
5 in 1-2 (phagemid) or 3-5 (phage) copies per bacteriophage particle (Figure 3). 
AH DNA constructs were verified by restriction analysis and DNA 
sequencing and tested for expression of recombinant protein by standard and 
well-understood techniques (Polyacrylamide gel electrophoresis, Western 
blot etc). 

10 

Example 2 

Production and Isolation of Recombinant STM Proteins 

Recombinant proteins were produced using vectors which represent 
different protocols for periplasmic expression systems. These vectors were 

15 (i) pGC: this vector allows high level expression of heterologous proteins by 
chemical (IPTG) induction, which are targeted to the periplasmic space by 
means of a leader sequence. The leader sequence is subsequently cleaved to 
produce the mature protein. In addition, this vector contains two in- frame 8 
residue tag aeq i i^ti ps (FLAG tags) which allow affinity purification of the 

20 recombinant protein, (ii) pPOW, which, like pGC, allows high level heat 
inducible expression of proteins targeted to the periplasmic space by means 
of a cleavable leader sequence and two in-frame 8 residue tag sequences 
(FLAG tags). 

Recombinant proteins were purified by the following methods, which 
25 are but two variations of well established techniques, (i) Bacterial clones in 
vector pGC were grown overnight in 2YT medium/S^C /200 rpm/lOOmg/ml 
ampicillin, 1% glucose (final). Bacteria were diluted 1/100 into either 0.5 or 
21 of 2YT medium supplemented with lOOmg/ml ampicillin, 0.1% glucose 
(final), and grown at 28°C/ 200 rpm. These cultures were grown to an optical 
30 density of between 0.2-0.4. at which stage they were induced with ImM IPTG 
(final). Cultures were allowed to grow for 16 hours (overnight) before 
harvesting. Bacteria were collected by centrifugation (Beckman JA-14 rotor 
or equivalent/6K/10min/4°C) and the periplasmic fraction collected by 
standard techniques. Briefly, this involved resuspension of bacterial pellets 
35 in a 1/25 th volume of spheroplast forming buffer consisting of lOOmM Tris- 
HC1/0.5M sucrose/0.5 mMEDTA (pH8.0), followed by addition of l/500th 
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volume of hen egg lysozyme (2mg/nil in water) and incubation for lOmin. A 
0.5x solution of the above spheroplasting buffer was then added to a final 
volume of 1/5 th of the original culture, and the incubation continued for a 
further 20min. The cell debris was then pelleted by cenuufugation (Beckman 
5 JA-14 rotor or equivalent/9K/15min/4°C) and the supernatant containing the 
periplasmic fraction collected. All of the above procedures were performed 
at 4"C. Samples were processed immediately by sonication, filtration 
through a 0.45u nitrocellulose membrane and processed immediately or 
stored at 4°C in the presence of sodium azide (0.05%). If freezing was 

10 required, no more than one freeze-thaw cycle was allowed, (ii) Bacterial 
clones in pPOW were grown overnight at 30°C in 100 ml 2xYT broth 
containing 100 ug/ml (w/v) ampicillin. On the following day cultures were 
used to inoculate 900 ml fresh 2xYT broth containing 100 ug/ml (w/v) 
ampicillin, to OD8O0nm = 0.2-0.5, and grown at 30°C with shaking (150-200 

15 rpm) until OD600nm = 4 i.e. late log phase. To induce recombinant protein 
expression, the temperature was raised to 42?C for 1 hour and then dropped 
to 20 °C for a further hour. Cells were harvested by centrifugation (Beckman 
JA-14 /6K rpm/5 min/4°C), the cell pellet resuspended in 100 ml extraction 
buffer (20mM Tris pH 8.0V 0.2rrgrmi (tttt) rysozyme/O. 1% (v/v) Tween-20) 

20 and incubated at 4°C overnight Samples were sonicated for 30 seconds and 
cellular debris c ollected by centrifugaiicn (3eckman JA-14 /14K rpm/10 
min/4°C). The aqueous phase, containing the "lysozyme" wash, was retained. 
Cell pellets were then washed twice with ice-cold water and this "osmotic 
shock" wash was retained. Each wash consisted of resuspending the pellet in 

25 100 ml ice-cold water followed by incubation on ice for 10 minutes in the 
first instance followed by 1 hour in the second instance. Following 
centrifugation (Beckman JA-14 /14K rpm/10 min/4°C), the pH of the aqueous 
phase was adjusted by addition of 10 ml lOxTBS, pH 8. The "lysozyme" and 
"osmotic shock" washes were pooled and constitute the soluble or 

30 "periplasmic" protein fraction. Periplasmic fractions were sonicated, filtered 
through a 0.45 ji nitrocellulose membrane and processed immediately or 
stored at 4°C in the presence of sodium azide (0.05%), PMSF (23 ug/ral) and 
EDTA (50 raM). 

Recombinant proteins were purified by affinity chromatography 
35 through a divinyl sulphone activated agarose (Mini-Leak)-linked anti-FLAG 
antibody column. Periplasmic extracts were directly loaded onto a 10 ml 
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anti-FLAG column which had been pre-equilib rated in TBS (pH 8) containing 
0.05% (w/v) sodium azide. Bound proteins were eluted with Inimunopure 
Gentle Ag/Ab Elution Buffer (Pierce). Samples were then dialysed against 
TBS/0.05% (w/v) azide (pH 8), concentrated by ultrafiltration over a 3 kDa 
5 cut-off membrane (YM3, Diaflo), and analysed by HPLC on a pre-calibrated 
Superose 12 HR or Superdex 200 HR column (Pharmacia Biotech), at a flow 
rate of 0.5 ml/min. Fractions corresponding to monomelic , dimeric and 
tetrameric species were collected, concentrated as above, and stored at 4°C 
prior to analysis. Protein concentration was determined 

10 spectrophotometrically using an extinction coefficient at A280 of 1.27 for the 
CTLA-4R extracellular domain, 0.92 forCTLA-4-Soml, 1.13 for CTLA-4- 
Som3, 1.05 for CTLA-4-Anti-Lys. All of the above protein chemistry methods 
are standard techniques within the field. Purified proteins were analysed by 
standard techniques for example polyacrylamide gel electrophoresis, western 

15 blot, dot blot etc. 

Cloning and expression in the bacteriophage expression vectors pHFA, 
pFAB-5c and fd-tet dog, and subsequent production of recombinant 
bacteriophage, were by standard and well-established techniques. Screening 
of Ehraries of randomised CTLA-4 STMs was by standard and we 11- 

20 established techniques (Galanis et al 1997). 

Example 3 

CTLA-4 STMs incorporating Somatostatin and Haemagglutinin Peptides. 
Initially the CDRl or the CDR3 loop structures of the CTLA-4 STM 

25 weTe replaced with the somatostatin polypeptide. This 14 residue 

polypeptide is conformationally constrained by an intra-disulphide linkage 
between Cys3 and Cysl4 (Figure 4). This was reasoned to form a discrete 
protein loop, analogous to the CDR loops found in antibodies, and 
particularly analogous to the long CDRs found in camelid antibodies which 

30 are also stabilised by a disulphide linkage. The effect of substituting CDRl 
in the presence or absence of Cysl20 ie. whether a dimer could be produced, 
was also tested. These experiments produced an unexpected and surprising 
result. Substitution of either CDR-1 or -3 with somatostatin significantly 
enhanced the production of monomelic protein. This is illustrated in Figure 

35 5 where replacement of the CDR-3 loop structure with somatostatin 
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significantly increased the ratio of monomeric to dimeric/tetrameric protein 
species. 

In further experiments, simultaneous replacement of both CDR-1 and - 
3 loop structures by somatostatin resulted in production of high-levels of 
5 monomeric protein. This shows that extensive loop structure replacements 
can be accommodated by the CTLA-4 scaffolding. Structurally, one of the 
somatostatin loops may lie flat against the face of the molecule in a manner 
analogous to that of the CDR-3 loop structure of CTLA-4 VLD. 

In a further extension of the CDR loop structure-replacement strategy, 
a region corresponding to CDR- 2 was replaced with the 8-residue 
haemagglutinin (HA) tag sequence. Use of the conforniationally constrained 
somatostatin loop in this position was considered unsuitable as this region 
partially encompasses the C" beta strand running the length of the molecule. 
The HA tag could be detected upon this CTLA-4 STM by use of an anti-HA 
antibody. Gel filtration experiments showed the presence of a range of 
protein species, from monomeric through to aggregated species suggesting 
that CDR-2-only substitutions were not stable (Figures 6,7). 

Final proof of principle that the CTLA-4 CDR loop structures could be 
replaced with other polypeptides to produce monomeric, soluble, STMs was 
by simultaneous replacement of all three CDR loop structures with two 
somatostatin and one HA epitope respectively. This STM produced a 
correctly folded and monomeric protein upon gel filtration chromatography 
[Figures 6,7). 

The positions of CDR loop structure substitutions within the CTLA-4 
VLD for the various STMs are shown in figure 6. HFLC profiles of affinity- 
purified STM proteins are shown in figure 7. Identical results were obtained 
for proteins produced in two different protein expression systems: pGC 
where protein expression is chemically induced, and pFOW where protein 
expression is temperature induced (see Example 2) (Figure 8). 
Polyacrylamide gel electrophoresis followed by western blot analysis 
indicated that the CTLA-4 STMs could be Teduced and ran at the expected 
molecular weights and absent of glycosylation. Testing of isolated 
monomeric STM proteins showed that they remained monomeric after zero, 
one, or two freeze-thaw cycles (figure 9). 

CTLA-4 STMs retained the correct conformation since a 
conformationally-specific anti-CTLA-4 antibody recognised STMs with both 
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CDRl and -3 loop structure replacements. Interestingly, this antibody 
recognised the wild type monomer and the dimer (CDRl replaced) poorly, 
contrasting with the strong reaction observed for the modified protein 
species. This suggests that in the wild type STM some form of local 
5 interaction is occurring that occludes the antibody binding site, and that this 
interaction is similar to the result when two GTLA-4 molecules are tethered 
together (presumably blocking access to the antibody). 

Example 4 

CTLA-4 STMs Based Upon a Camel onli-Lysozyme Antibody. 

The camel V H H antibody cAb~Lys3 isolated from immunised camels 
specifically binds within the active site cleft of hen egg lysozyme (Desmyter 
et al. 1996). To illustrate the ability of CTLA-4 STMs to function in a similar 
fashion, the three CDR loop structures of CTLA-4 VLD STM were replaced 
with the three CDR loop regions from cAb-Lys3. Positions and sequence of 
the substitutions are shown in figure 6. Expression of this STM (2V8) in 
either pGC or pPOW based expression systems resulted in production of 
predominantly monomelic soluble protein (Figures 7, 8). Protein solubility 
of this CTLA-4 STM was superior to native CTLA-4 VLD. ELJSA analysis 
showed that (pGC produced) purified monomelic protein specifically bound 
hen egg lysozyme compared to non-specific antigens and compared to the 
CTLA-4 STM with somatostatin substituted within the CDRl loop structure 
(PP2) (Figure 10A). Real-time binding analysis by BlAcore showed that the 
lysozyme specifically bound to immobilised anti-lysozyme STM (Figure 10B). 
The CTLA-4 STM framework is thus folding correctly and presenting the 
CDR loop structures in a manner in which they can bind lysozyme antigen. 
To further enhance expression of the CTLA-4 VLD anti-lysozyme, the coding 
sequence was adjusted by splice overlap PCR to comprise codons preferential 
for E.coli expression. 

Example 5 

CTLA-4 STMs Based Upon a Human anti-Melanoma Antibody. 

The human-derived anti-melanoma antibody V86 specifically binds 
human melanoma cells. This antibody is unusual in that binding affinity 
resides entirely within the V H region, addition of a cognate V L decreases 
binding efficiency, and that the V H domain expressed with a small segment 
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of the V L domain has a high degree of solubility (Cai and Garen, 1997). To 
further illustrate that replacement of CTIA-4 VLD CDR loop structures 
enhances solubility and that the resultant STMs can be produced in bacterial 
expression systems, the three CDR loop structures of CTLA-4 were replaced 
5 with the three CDR loop regions from Vfi6. Positions and sequence of the 
substitutions are shown in Figure 6. Expression of this STM (3E4) in pGC 
again resulted in production of predominantly monomeric soluble protein 
(Figure 7) with enhanced solubility compared to the CTLA-4 VLD. 

10 Example 6 

Construction of CTLA-4 STMs as Libraries of Binding Molecules 

To select CTLA-4 STMs with novel binding specificities, VLD libraries 
were produced containing randomised CDRl and CDR3 loop structures. 
Oligonucleotide primers used for library construction are listed in figure 1. 
15 Combinations of oligonucleotide primers used for library construction are 
shown in Table 3. 

Table 3. CTLA-4 STM Library Combinations 
20 __ CDRl 



\ CDR3 


4453" 


4254 


5449 


5451 


5452 


54S0 


5446 


4835 


' 4402 


+ 1 


+ 1 


////////// 


////////// 


//«'/ 


mnnm 


////////// 


////////// 


4275 


+ 1 


+ 1 


tlltiliill 


////////// 


iifiinm 


lumm 


////////// 


////////// 


5470 


imumi 


IflltUlil 


+ 2 


+ 2 


+2 


+2 


+ 2 


////////// 


5474 


iiitiiiiii 


lililillli 


+2 


+ 2 


+ 2 


+ 2 


+ 2 


////////// 


5471 


////////// 


llltlHIli 


+ 2 


+ 2 


+ 2 


+ 2 


+2 • 


////////// 


5472 


////////// 


////////// 


+ 2 


+ 2 


+2 


+ 2 


+2 


////////// 


5475 


////////// 


////////// 


+ 2 


+2 


+ 2 


+ 2 


+2 


IIIIIIIIII 


5473 


////////// 


////////// 


+2 


+2 


+ 2 


+ 2 


+ 2 


Illlllllll 


4836 


////////// 


////////// 


////////// 


////////// 


////////// 


////////// 


////////// 


+ 3 



*: oligonucleotide number. 
+ : library combination. 
1,2,3: describes library number. 

25 
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DNA constructs encoding the resultant libraries were cloned into 
vectors pHFA or pFAB.Sc for production of fd-phagemid based libraries and 
into pfd-Tet-Dog for production of fd-phage based libraries (see examples 1 
and 2). Library 1 was cloned into vector pHFA and consisted of 2.1 x 10 7 
5 independent clones. Library 3 was cloned into vectors pHFA (5.7 x 10 s 
independent clones) and pfd-Tet-Dog [2.2 x Id* independent clones). 
Library 2 was cloned into pFAB.Sc [1.7 x 10 7 independent clones) and into 
pfd-Tet-Dog (1.6 x 10 5 independent clones). Numbers of independent clones 
were determined by counting gross numbers of transformed colonies 

10 constituting the library, followed by assaying for the presence and proportion 
of CTLA-4 STM-specific DNA. 

For library 2, Uib variability of the full library was tested by sequencing 
of representative clones. These results are presented in Table 4. The 
expected heterogeneity of insert size and sequence was observed. A high 

1 5 proportion of UAG termination codons were observed, consistent with the 
oligonucleotide randomisation strategy. To prevent these codons causing 
premature teraiination of the CTLA-4 STM gene3 protein fusions, libraries 
were transferred into the E.coh strains Tg-1 and JM109, which suppress this 
termination codon by insertion of a glutamic acid residue. Cysteine residues 

20 were present in the high numbers expected from the desgn of the 

oligonucleotides.and were in positions capahle of forming intra- and inter- 
CDR loop structure disulphide bonds. 
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Table 4 CDRl and CDR3 Inserts from a Representative Series of Library 2 
Clones 



CLONE 


CDRl 


CDR3 


3M-2 


ND1 


LPS S DTRAYS 


3M-3 


QESGGRPG 


LPRGPPLLSL 


3M-5 


SPGRCLN 


ND 


3M-6 


E WK R * HG G 


LCPGACGC VY 


3M-7 


NSG *NEGG 


ND 


3M-11 


DKPVTKSG 


ND 


3M-17 


SPG ACP* 


ND 


3M-18 


SPGKCDQ 


ND 


3M-19 


SPGMCAR 


LMYPPPYYL 


3M-Z0 


ND 


P FL FLPC*FFF 


3N-1 


WTLGHHKLCEG 


LFTCLLALCS 


3N-2 


SPGECYG 


S WLSTTXCLSSCS 


3N-3 


SPG*CQD 


LLGSLLSCFASLS 


3N-4 


SPG*CQD 


SPGSLLSCFASXS 


3N-5 


SPGRCTD 


VICHSS VCLSD/EVC 


3N-6 


ND 


DLPSYLACSI 


3N-7 


SPGRCDA 


ALCWDVFYCSFPSY 


3N-S 


ELFGHARYCKG 


VSITSP*LCPVKVFD 


3N-9 


SPGKVN 


LFVPFVSP 


3N-12 


SPGDLWV 


ESGLSPVSPCSLYSL 


3N-13 


TSANGPYG 


PWAYRFLAVL 


3N-14 


RKTREKYG 


ELMYPPPYYLGI 


3N-15 


SPGQELT 


ELFFLLYAPC YLFQR 



ND: Not Done 

*: UAG termination codon 
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Bacteriophage particles displaying CTLA-4 STMs as gene 3 protein 
fusions were rescued from E.coli cells by standard protocols and panned 
against antigens presented in a number of contexts as described in the 
5 following examples. 

Example 7 

CTLA-4 STM Li braries: Selection against Antigens on Solid Supp orts. 
Four different antigens falling into a class of proteins with clefts or 

10 crevices within their structures were selected for screening. It was 
anticipated that the CTLA-4 VLD STMs, being of smaller size than 
antibodies, and possessing elongated CDR loop structures (especially CDR-3) 
would be able to access these cleft regions. The antigens selected were: (i) 
hen egg lysozyme (EC 3.2.1.17); (ii) bovine carbonic anhydrase (EC 4.2.1.1 ); 

15 (iii) fungal a-amylase (EC 3.2. 1.1); and (iv) Streptoailoteichus Hindustanis 
resistance protein ShBle (Gatignol et al. 1986). For binding to plates, 
antigens in coating buffer (lmg/ml in 0.1M NaHC03 pH8.5) were bound to 
Costar ELISA plates by standard procedures. Rescued phage and phagemid- 
derived libraries were panned by standard and well-understood procedures 

20 except that lower than standard number of washes were employed to allow 
low affinity binding phage to be selected. Figure 11 shows titres of libraries 
selected against ShBle. After round 4. recovered bacteriophage titres were 
higher than controls. To those skilled in the art, this represents selection of 
specific binding moieties, and it is then a routine process to produce these 

25 selected CTLA-4 VLD STMs using expression vectors such as pGC or pPOW 
(as described in example 2). 

Example B 

CTLA-4 STM L ibraries: Selection against Antigens in Solution. 

30 For selection in solution, the antigens bovine carbonic anhydrase and 

fungal a-amylase were biotinylated and selections performed in solution 
using capture by streptavidin coated magnetic beads. Throughout these 
experiments washes were kept constant at either 2 or 5 washes per selection 
round. Titres of recovered bacteriophage post-elution are shown in Figure 

35 12. After round 4, recovered bacteriophage titres were higher than controls. 
To those skilled in the art, this represents selection of specific binding 
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moieties, and it is then a routine process to produce these selected CTLA-4 
VLD STMs using expression vectors such as pGC or pPOW (as described in 
example 2). 

5 Example 9 

CTLA-4 STM Li braries: Selection in an Alternative Display and Selection 
System . 

To allow further maturation and selection of antigen binding STMs, 
the CTLA-4 STM library was ligated into a plasmid to add a downstream C- 

10 terminal spacer polypeptide (heavy constant domain). Upstream 

transcriptional and translational initiation sequences were added by PCR 
amplification using specific oligonucleotides (Figure 1). This PCR DNA was 
used as a template for production of RNA followed by translation and display 
of the library on ribosomes in a coupled cell free translation system as 

15 described by He and Taussig (1997). To demonstrate binding, CTLA-4 STM 
ribosome complexes were panned on hepatitis B surface antigen (hbsa), 
glycophorin (glyA) and bovine serum albumin (BSA) coated dynabeads. RNA 
from ribosome complexes bound to hbsa, glyA and BSA was recovered by 
RT-PCR- It is then a routine process to clone these RT-PCR products into an 

20 expression vector such as pGC or pPO W (as described in example 2) allowing 
production of CTLA-4 STMs. It will be appreciated by persons skilled in the 
art that numerous variations and/or modifications may be made to the 
invention allowing display of libraries of CTLA-4 STMs as ribosome 
complexes (as in this example) as well as display on the surface of live cells 

25 (which may be derived from a eukaryotic or prokaryotic background) and 
may include bacteria, yeast, mammalian or insect cells. 

Example 10 

CTLA-4 STMs: Affinity Maturation and CDR2 Mutation. 

30 To allow further maturation and selection of antigen-binding STMs, 

and the construction of randomised CDR-1, -2 and -3 libraries, CDR-2 
randomised oligonucleotide primers were produced (Figure 1). A variation of 
these primers contained conserved cysteine residues to allow construction of 
STMs with CDR2-CDR3 disulphide linkages mimicing those found in llama 

35 single domain antibodies. Splice overlap PCR allowed creation of libraries 
containing all three CDR loop structures randomised. 



WO 99/45110 



30 



PCT/AU99/00136 



It will be appreciated by persons skilled in the art that numerous 
variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
5 be considered in all respects as illustrative and not restrictive. 
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SEQUENCE LISTING 
Applicant: Dtatech Pty Ltd 

5 

Title of Invention: V-like domain binding molecules 

Prior Application Number; PP2210 
10 Prior Application Filing Date: 1993-03-06 

Number of SEQ ID NOs : 138 

Software: Patentln Ver. 2.1 

15 

SEQ ID NO: 1 
Length: 6 
Type : PRT 

Organism: Homo sapiens 

20 

Sequence ; I 
Met Tyr Pro Pro Pro Tyr 
1 5 

25 

SEQ ID NO: 2 
Length: 54 
Type: DNA 

Organism: Artificial Sequence 

30 

Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

35 Sequence: 2 

ttattactcg cggcccagcc ggccatggcc gcaatgcacg tggcccagcc tgct 54 
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SEQ ID NO: 3 
Length: 60 
Type : DNA 
5 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: 



Sequence: 3 

ttattactcg cggcccagcc ggccatggcc gcaatgcacg tggcccagcc tgctgtggta 60 

SEQ ID NO: 4 
15 Length: 45 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

20 Other information: Description of Artificial Sequence: 
Oligonucleotide PCR prix»r 

Sequence; 4 

tctcacagtg cacaggcaat gcacgtggcc cagcctgctg tggta 45 



Oligonucleotide PCR primer 



10 



25 



SEQ ID NO: 5 



Length: 39 



Type: DNA 



Organism: Artificial Sequence 



30 



Feature : 



Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



35 



Sequence: 5 



tctcacagtg cacaggcaat gcacgtggcc cagcctgct 



39 
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SEQ ID NO: 6 
Length: 43 
Type : DNA 
5 Organism: Artificial Sequence 



Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence: 6 

gcccagccgg ccgaattcgc aatgcacgtg gcccagcctg ctg 4 3 



SEQ ID NO: 7 
15 Length: SO 
Type: DNA 

Organism: Artificial Sequence 



Feature: 

20 Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



Sequence: 7 

gcagctaata cgactcacta taggaacaga ccaccatgga cgtggcccag cctgctgcgg €0 

25 

SEQ ID NO: 8 
Length: 42 
Type: DNA 

Organism: Artificial Sequence 

30 

Feature : 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

35 Sequence : 8 

atctgcggcc gctacataaa tctgggtacc gttgccgatg cc 42 
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SEQ ID NO: 9 
Length: 66 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

Other information: Description o£ Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence: 9 

gctgaattct gatcagtgat ggtgatggtg atgtgcggcc gcgtcagaat ctgggcacgg 60 
ttctgg 66 



15 SEQ ID NO: 10 
Length: 51 
Type: DNA 

Organism: Artificial Sequence 

20 Feature: 

Other infooatirc: Z>escElpcisn. cf Art! facial Setrv 
Oligonucleotide FCR primer 



Sequence: 10 

25 gcccttgggc cgggagatgg tctgcttcag tggcgagggc aggtctgtgt g 51 

SEQ ID NO: 11 
Length i 49 
Type: DNA 
30 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

35 
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Sequence: 11 

cgagggcagg tctgtgtggg tcacggtgca cgtgaacctc tccccggag 49 

SEQ ID NO: 12 
5 Length: 51 
Type: DNA 

Organism; Artificial Sequence 
Feature: 

10 Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence: 12 

cgtgaacctc tccccggagt tccagtcatc ctcgcagatg ctggcctcac c 51 

15 

SEQ ID NO: 13 
Length: 84 
Type: DNA 

Organism: Artificial Sequence 

20 

Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

25 Sequence: 13 

agctttgtgt gtgagtatgc agctggctgc aagaatttct tctggaagac tttcacatcc 60 
tgtgccactg aggtccgggt gaca 84 

SEQ ID NO; 14 
30 Length: B4 
Type : DNA 

Organism: Artificial Sequence 



35 



Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 
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Sequence: 14 

ctgggtaccg ttgccgatgc cacaggatgt gnaagtcttc cagaagaaat tcttgcagcc 60 
agcctccacc ttgcagatgt agag 84 

SEQ ID HO: 15 
Length: 75 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

15 Sequence: 15 

agctttgtgt gtgagtatgc agctggctgc aacaatnnfcn nknnknnknn knnkacatcc 60 
tgtgccactg aggtc 75 



10 



SEQ ID SC: 16 
20 Length: 75 
Type: 3P. 

Organism: Artificial Sequence 



Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



Sequence: 16 

ctgggtaccg ttgccgatgc cacaggatgt mnnnmnmnnm nnmnnmnnat tcttgcagcc 60 
30 agcctccacc ttgca 75 



SEQ ID NO: 17 
Length: 21 
Type: DNA 
35 Organism: Artificial Sequence 
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Feature : 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

5 Sequence: 17 

gtaggttgcc gcacagactt c 21 

SEQ ID NO: 18 
Length: 66 
10 Type: DMA 

Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: 
15 Oligonucleotide PCR primer 

Sequence: IB 

gaagtctgtg cggcaaccta cccgtatgac gttccggact acgccctiga tgartccatc 60 
tgcacg * €5 

20 

SEQ ID NO: 19 
Length: 78 
Type: DMA 

Organism: Artificial Sequence 

25 

Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



30 



Sequence: 19 

gccagctttg tgtgtgagta tgccagtggc tacaccatcg ggccgtactg catgggcgtc 60 
cgggtgacag tgcttcgg 73 
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SEQ ID MO: 20 
Length: 60 
Type: DNA 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

10 Sequence: 20 

tgtgcggcag ccatcaacat gggcggtggc atcaccttcc tagatgattc catctgcacg 60 



SEQ ID HO: 21 
15 Length: 60 
Type: DNA 

Organism: Art: f i rial Sequence 



Feature: 

Other information: Description of Artificial Sequence: 
Oligonndeoti.de FCR primer 



Sequence: 21 

atctaggaag gtgatgccac cgcccatgtt gatggctgcc gcacagactt cagtcacctg 60 

SEQ ID NO: 22 
Length: 69 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 
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Sequence : 22 

cagcccgtgg ccgcactcgt agtaggacgc gtagatcgtc gagtccacct tgcagatgta 60 
gagtcccgt 69 

5 SEQ ID NO: 23 
Length: 72 
Type : DNA 

Organism: Artificial Sequence 
10 Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence : 2 3 

15 aatctgggta ccgttgccga tgccggagtc atagccgtac cctcccgtgg acagcccgtg 60 
gccgcactcg ta 72 

SEQ ID NO: 24 
Length: 78 
20 Type: DNA 

Organise: ft -~ i f <~i »1 Sequence 

Feature: 

Other information: Description of Artificial Sequence: 
25 Oligonucleotide PCR printer 

Sequence: 24 

gccagctttg tgtgtgagta tgccagtgga ttcaccttca gttcctacgc catgtccgtc 60 
cgggtgacag tgcttcgg 7$ 

30 

SEQ ID NO: 25 
Length: 51 
Type: DNA 

Organism: Artificial Sequence 

35 
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Feature : 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

5 Sequence: 25 

gccatctccg gatccggagg ctcgacctac ctagatgatt ccatctgcac g 51 

SEQ ID NO: 26 
length: 54 
10 Type: DNA 

Organism: Artificial Sequence 

Feature : 

Other information: Description of Artificial Sequence: 
15 Oligonucleotide PCR primer 

Sequence; 26 

gtaggtcgag cctccggatc cggagatggc tgccgcacag acttcagtca cctg 54 

20 SEQ ID WO: 21 
Lencrh: 69 
Type : DNA 

Organism: Artificial Sequence 
25 Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence: 27 

30 cacgtccatg tagtagtctc cctcctcgcc gcgcagtccc cagcccacct tgcagatgta 60 
gagtcccgt 69 

SEQ ID NO: 26 
Length: 51 
35 Type: DNA 

Organism: Artificial Sequence 
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Feature: 



Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



5 



Sequence: 28 



aatctgggta ccgttgccga tgcccacgtc catgtagtag tctccctcct c 



51 



SEQ ID NO: 29 



10 Length: 66 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

15 Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence: 29 

agcrtrgtgt: gxgagtargc a r- i r-TT? rz r-£ nnxr-.n rrnjcr ncinkgccac tgaggtccgg 60 
20 gtgaoa 66 

SEQ ID HO: 30 
Length: 68 
Type: DNA 
25 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: 



Sequence; 30 

cacgtggccc agcctgctgt ggtactggcc agcagccgag gcatcgccag ctttgtgtgt 60 



Oligonucleotide PCR primer 



30 



gagtatgc 



68 
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SEQ ID NO: 31 
Length: 66 
Type : DNA 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

10 Sequence: 31 

gtgtgtgagt aegegtnenn snnsnnsnns nnsnnstgen nsgctactga ggttcgtgtg 60 
acegtc 66 



SEQ ID NO: 32 
15 Length: 73 
Type: DNA 

Organism: Artificial Sequence 



Feature: 

20 Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



Sequence: 32 

gecagctttg tgtgtgagta tgcannknnk nnknnknnkn nknnkggcgt ccgggtgaca 60 
25 gtgettegge agg 73 



SEQ ID NO: 33 
Length: 82 
Type: DNA 
30 Organism: Artificial Sequence 



Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR Primer 



35 
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Sequence: S3 

gecagctttg tgtgtgagta tgcannknnk nnknnknnkn nknnknnktg cnnkggcgtc 60 
cgggtgacag tgetteggea gg 

5 SEQ ID NO: 34 
Length: 82 
Type : DNA 

Organism: Artificial Sequence 
10 Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR Primer 

Sequence: 34 

15 gecagctttg tgtgtgagta tgcannknnk ywynnkywyn nknnkywytg cnnkggcgtc 60 
cgggtgacag tgetteggea gg 02 

SSQ ID NO: 35 
Leogtii: 70 
20 Type: DNA 

Crga r .i ra : Artificial Sequence 

Feature : 

Other ir.fornation: Description of Artificial Sequence: 
25 oligonucleotide PCR primer 

Sequence: 35 

gecagctttg tgtgtgagta tgcatctcca ggcnnknnkn nknnkgtccg ggtgacagtg 60 
etteggcagg 70 

30 

SEQ ID NO: 36 
Length: 70 
Type : DNA 

Organism: Artificial Sequence 

35 
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Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

5 Sequence: 36 

gccagctttg tgtgtgagta tgcatctcca ggcnnktgcn nknnkgtccg ggtgacagtg 60 
cttcggcagg 70 



SEQ ID NO: 37 
10 Length: 68 
Type: DNA 

Organism: Artificial Sequence 



Feature: 

15 Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



Sequence : 37 

gtgactgaag tctgtgcggc aacctacnnk nnkgggnnkg agttgacctt cctagatgat 60 
20 tccatctg 68 



SEQ ID NO: 3B 
Length: 30 
Type: DNA 
25 Organism: Artificial Sequence 

Feature : 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

30 



Sequence: 38 

gtaggttgcc gcaeagactt cagtcacctg 



30 
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SBQ ID NO: 39 
Length: 68 
Type: DMA 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

10 Sequence: 39 

gtgactgaag tctgtgcggc atgctacnnk nnkgggnnkg agttgacctt cctagatgat 60 
tccatctg 6B 

S£Q ID NO: 40 
15 Length: 30 
Type: DKA 

Organisa: Artificial Sequence 
re=Tnr^: 

20 Otiier information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence: 40 

gtagcatgcc gcacagactt cagtcacctg 30 

25 

SEQ ID NO; 41 
Length: 69 
Type: DNA 
30 Organism: Artificial Sequence 



Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 
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Sequence: 41 

ctgggtaccg ttgccgatgc cmnnmnninnn mnnmnnmnnm nnmnnmnnct ccaccttgca 60 
gatgtagag 59 

5 SEQ ID NO: 42 
Length: 67 
Type : DNA 

Organism: Artificial Sequence 
10 Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



15 



30 



Sequence: 42 

aggtggaann annannanns nnsnnstgcn nsnnsnnsnn snnsnnsnns ggcatcggca 60 
acggtac 67 



SEQ ID NO: 43 
Length: 78 
20 Type: DNA 

Organism: Artificial Sequence 

Feature: 

other information: Description of Artificial Sequence: 
25 Oligonucleotide PCR primer 

Sequence: 43 

aatctgggta ccgttgccga tgccmnnmnn mnnmnnmnnm nnmnnmnnmn nmnncacctt 60 
gcagatgtag agtcccgt 78 



SEQ ID NO: 44 
Length: 93 
Type : DNA 

Organism: Artificial Sequence 



35 
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Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

5 Sequence: 44 

aatctgggta ccgttgccga tgcccagmnn mnnirjinmnnm nnmnnmnnmn nmnnmnnrann 60 
mnnmnnctcc accttgcaga tgtogagtcc cgt 93 

SEQ ID NO: 45 
10 Length: 81 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

15 Other information: Description of Artificial Sequence; 
Oligonucleotide PCR primer 

Sequence: 45 

a^rctcggta ccgttgccga tgccmnnmnn mnnmnngcara nnmnnmnnmn nmnnmnncac 60 
20 cttgcagatg tagagtcccg t $1 

SEQ ID NO: 4 6 
Length: 87 
Type: DNA 
25 Organism: Artificial Sequence 

Feature: 

Other information: Description 'of Artificial Sequence: 
Oligonucleotide PCR primer 

30 

Sequence: 4 6 

aatctgggta ccgttgccga tgccnmnmnn mnnmnnmnng camnnmnnmn nmnnmnnmnn €0 
mnncaccttg cagatgtaga gtcccgt 87 
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SEQ ID NO: 47 
Length: 99 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



10 Sequence : 4"? 

aatctgggta ccgttgccga tgccrannmnn rannmnnmnnia nngcamnnmn nmnnmnnmnn 60 
irmnmnnmnnm nnmnncacct tgcagatgta gagtcccgt 99 



SEQ ID NO: 4 8 
15 Length: 87 
Type: DNA 

Organism: Artificial Sequence 



Feature : 

20 Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



Sequence : 48 

aatctgggta ccgttgccga tgccrwrmnn mnnmnnmnng camnnmnnmn nmnnmnnmnn 60 
25 mnncaccttg cagatgtaga gtcccgt 87 



SEQ ID NO: 49 
Length: 70 
Type : DNA 
30 Organism: Artificial Sequence 



Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

35 
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Sequence: 4 9 

atgcacgtgg cccagcctgc tgtggtgctg gccagcagcc gtggcatcgc cagctttgtg 60 
tgtgaatatg 70 

5 SEQ ID NO: 50 
Length: 77 
Type: DNA 

Organism: Artificial Sequence 
10 Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence: SO 

15 gccagctttg tgtgtgaata tgcgtctggc tataccatcg gcccgtactg catgggtgtg 60 
cgtgtgaccg tgctgcg 77 

SEQ ID NO: 51 
Length: 54 
20 Type: DNA 

Organism: Artificial Sequence 

Feature : 

Other information: Description of Artificial Sequence: 
25 Oligonucleotide PCR primer 

Sequence: Si 

gtgcgtgtga ccgtgctgcg tcaggcggat agccaggtga ccgaagtttg cgcg 54 

30 SEQ ID NO: 52 
Length: 75 
Type: DNA 

Organism: Artificial Sequence 
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Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



5 Sequence: 52 

caggtgaccg aagtttgcgc ggcagcgatc aacatgggcg gtggcatcac cttcctggat 60 
gattccatct gcacc 75 



SEQ ID NO: 53 
10 Length: 66 
Type: DNA 

Organism: Artificial Sequence 



Feature: 

15 Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence: 53 

cagaccctgg atggtcaggt tcacctggtt accgctggag gcgccggtgc agatggaatc 60 
20 atccag 6e 



SEQ ID NO: 54 
Length: 57 
Type: DNA 
25 Organism: Artificial Sequence 



Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 

Sequence: 54 

cactttgcag atgtacagac cggtatccat ggcacgcaga ccctggatgg tcaggtt 57 
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SEQ ID NO: 55 
Length: 66 
Type: DNA 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: 
Oligonucleotide. PCR primer 

10 Sequence: 55 

caggccatga ccgcattcgt aataagacgc atagatggtg ctatccactt tgcagatgta 60 
cagacc 66 



SEQ ID NO: 56 
15 Length: 69 
'Type: DNA 

Organism: Artificial Sequence 
Feature: 

20 Other information: Description of Artificial Sequence: 
Oligonucleotide PCR primer 



Sequence: 56 

ctgggtaccg ttgccgatgc cagaatcgta gccatagcca ccggtggaca ggccatgacc 60 
25 gcattcgta gg 



SEQ ID NO: 57 
Length: 672 
Type: DNA 
30 Organism: Homo sapiens 



Sequence: 57 

atggcttgcc ttggatttca gcggcacaag 
ccctgcactc tcctgttttt tcttctcttc 
35 gcccagcctg ctgtggtact ggccagcagc 



gctcagctga acctggctgc caggacctgg 60 
atccctgtct tctgcaaagc aatgcacgtg 120 
cgaggcatcg ccagctttgt gtgtgagtat 180 
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gcatctccag gcaaagccac tgaggtccgg gtgacagtgc ttcggcaggc tgacagccag 240 

gtgactgaag tctgtgcggc aacctacatg acggggaatg agttgacctt cctagatgat 300 

tccatctgca cgggcacctc cagtggaaat caagtgaacc tcactatcca aggactgagg 3S0 

gccatggaca cgggactcta catctgcaag gtggagctca tgtacccacc gccatactac 420 

ctgggcatag gcaacggaac ccagatttat gtaattgatc cagaaccgtg cccagattct 480 

gacttcctcc tctggatcct tgcagcagtt agttcggggt tgttttttta tagctttctc 540 

ctcacagctg tttctttgag caaaatgcta aagaaaagaa gccctcttac aacaggggtc 600 

tatgtgaaaa tgcccccaac agagccagaa tgtgaaaagc aatttcagcc ttattttatt 660 

cccatcaatt ga 672 

SSQ ID NO: 58 
Length: 115 
Type : PRT 

Organism: Homo sapiens 
Sequence: 58 

Ala Met His Val Ala Gin Pro Ala Val Val Leu Ala Ser Ser Arg Gly 
15 10 15 

lie Ala Ser Phe Val Cys Glu Tyr Ala Ser Pro Gly Lys Ala Thr Glu 
20 25 30 

Val Arg Val Thr Val Leu Arg Gin Ala Asp Ser Gin Val Thr Glu Val 
35 40 45 

Cys Ala Ala Thr Tyr Met Met Gly Asn Glu Leu Thr Phe Leu Asp Asp 
50 55 60 



Ser He Cys Thr Gly Thr Ser Ser Gly Asn Gin Val Asn Leu Thr He 

30 65 70 75 . 80 

Gin Gly Leu Arg Ala Met Asp Thr Gly Leu Tyr He Cys Lys Val Glu 

85 90 95 



20 



25 



35 Leu Met Tyr Pro Pro Pro Tyr Tyr Leu Gly He Gly Asn Gly Ala Gin 
100 105 no 
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lie Tyr Val 
115 



SZQ ID NO: 59 
Length: 7 
Type : PRT 

Organism: Homo sapiens 
Sequence: 59 

Ser Pro Gly Lya Ala Thr Glu 
1 5 



SEQ ID NO; 60 
Length: 9 
Type: PRT 

Organism: Homo sapiens 
Sequence: 60 

Tyr Met Met Gly Asn Glu Leu Thr Phe 
1 5 



25 



SEQ ID NO; 61 
Length : 9 
Type: PRT 
30 Organism: Homo sapiens 



35 



Sequence: 61 

Leu Met Tyr Pro Pro Pro Tyr Tyr Leu 
1 5 
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SEQ ID NO: 62 
Length: 14 
Type : PRT 

Organism: Homo sapiens 
Sequence: 62 

Ala Gly Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 
1 5 10 



SEQ ID NO: 63 
Length: 9 
Type: PRT 

Organism: Artificial Sequence 
Feature: 

Other information: Description of Artificial Sequence: Haemag glut inin 
tag 



20 Sequence: 63 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1 5 



25 SEQ ID NO: 64 
Length: 11 
Type : PRT 

Organism: Artificial Sequence 
30 Feature: 

Other information: Description of Artificial Sequence: Sequence from 
anti-lysozyme antibody 



35 



Sequence: 64 

Ser Gly Tyr Thr lie Gly Pro Tyr Cys Met Gly 
1 5 10 
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SEQ ID NO: 65 
Length: 10 
Type : PRT 
5 Organism: Artificial Sequence 

Feature : 

Other information: Description of Artificial Sequence: Sequence from 
anti-lysozyme antibody 



10 



20 



Sequence: 65 

Thr Tyr Met Met Gly Asn Glu Leu Thr Phe 
1 5 10 



SEQ ID NO: 66 
Length: 24 
Type: PRT 

Organism: Artificial Sequence 



Feature: 

Other information: Description of Artificial Sequence: Sequence free: 
anti-lysozyme antibody 

25 Sequence: 66 

Asp Ser Thr lie Tyr Ala Ser Tyr Tyr Glu Cys Gly His Gly Leu Ser 
15 10 is 

Thr Gly Gly Tyr Gly Tyr Asp Ser 
30 20 



SEQ ID NO: 67 
Length: 11 
35 Type : PRT 

Organism: Homo sapiens 
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Sequence: 67 

Ser Gly phe Thr Phe Ser Ser Tyr Ala Met Ser 
1 5 10 

S 

SEQ ID NO: 58 
Length: 10 
Type : PRT 
10 Organism: Homo sapiens 

Sequence: 68 

Ala lie Ser Gly Ser Gly Gly Ser Thr Tyr 
1 5 10 

15 

SEQ ID NO: 69 
Length: 15 
Type : PRT 
20 Organism: Homo sapiens 

Sequence: 69 

Gly Trp Gly Leu Arg Gly Glu Glu Gly Asp Tyr Tyr Met Asp Val 
15 10 15 

25 

SEQ ID MO: 70 
Length: 21 
Type : PRT 
30 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: Flag-tag 
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Sequence: 70 

Ala Ala Ala Asp Tyr Lys Asp Asp Asp Asp Lys Ala Ala Asp Tyr Lys 
15 10 15 

5 Asp Asp Asp Asp Lys 
20 



SEQ ID NO: 71 
10 Length: 14 
Type : PRT 

Organism: Homo sapiens 
Sequence: 71 

15 Ser Phe Val Cys Glu Tyr Ala Ser Pro Gly Lys Ala Thr Glu 
1 5 io 



SEQ ID NO: 72 
20 Length: 18 
Type : PRT 

Organism: Artificial Sequence 
Feature: 

25 Other information: Description of Artificial Sequence: Fusion protein 
Sequence: 72 

Ser Phe Val Cys Glu Tyr Ala Ser Gly Tyr Thr He Gly Pro Tyr Cys 
15 10 15 

30 

Met Gly 
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SEQ ID NO: 73 
Length: 24 
Type: PRT 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
Sequence: 73 

10 ser Phe Val Cys Glu Tyr Ala Ala Gly Cya Lya Asn Phe Phe Trp Lys 
15 10 15 

Thr Phe Thr Ser Cys Ala Thr Glu 
20 

15 

SEQ ID MO: 74 
Length: 18 
Type: PRT 
20 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Seque-ce: Fusion protein 

25 Sequence: 74 

Ser Phe Val Cys Glu Tyr Ala Ser Gly Phe Thr Phe Ser Ser Tyr Ala 
15 10 15 

Met Ser 

30 



SEQ ID NO: 7S 
Length: 15 
35 Type: PRT 

Organism: Artificial Sequence 
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Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

5 

Sequence: 75 

Ser Phe Val Cys Glu Tyr Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly 

1 .... 5 10 15 

10 

SEQ ID NO: 76 
Length: 18 
Type: PRT 

Organism: Artificial Sequence 

15 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

20 Sequence: 76 

Ser Phe Val Cys Glu Tyr Ala Xaa Xaa iaa Xgg 
15 10 

Xaa Gly 

25 

SEQ ID NO: 77 
Length: 21 
30 Type: PRT 

Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence : Fusion protein 
35 containing random sequence 



SAa Xsa » a a Xaa Cy 
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Sequence : 77 

Ser Phe Val Cys Glu Tyr Ala Xaa Xaa Ala Arg Xaa Ala Arg Xaa Xaa 
15 10 15 



5 Ala Arg Cys Xaa Gly 
20 



SEQ ID NO: 78 
10 Length: 14 
Type : PRT 

Organism: Artificial Sequence 



Feature: 

15 Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 



Sequence: 78 

Ser Phe Val Cys Glu Tyr Ala Ser £=~z ZLt iaa Kaa Xaa Xaa 
20 1 5 IG 



SEQ ID NO: 75 
Length: 14 
25 Type: PRT 

Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
30 containing random sequence 

Sequence: 79 

Ser Phe Val Cys Glu Tyr Ala Ser Pro Gly Xaa Cys Xaa Xaa 
15 10 

35 
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SEQ ID NO: 80 
Length: 18 
Type : PRT 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

10 Sequence: GO 

Ser Phe Val Cys Glu Tyr Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ala 
15 10 15 

Thr Glu 



SEQ ID NO: 81 
20 Length: 18 
Type : PRT 

Organism: Artificial Sequence 
Feature: 

25 Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

Sequence: 81 

Ser Phe Val Cys Glu Tyr Ala Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Ala 
30 1 5 10 15 

Thr Glu 



35 
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SEQ ID NO: 82 
Length: 24 
Type : PRT 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

10 Sequence: 82 

Ser Phe Val Cys Glu Tyr Ala Ala Gly Cys Lys Asn Xaa Xaa Xaa Xaa 
15 10 15 

Xaa Xaa Thr Ser Cys Ala Thr Glu 
15 20 



SEQ ID NO: 83 
Length: 25 
20 Type: PRT 

Organise: Eerie sapiens 

Sequence: 83 

Gin Val Thr Glu Val Cys Ala Ala Thr Tyr Het Met Gly Asn Glu Leu 
25 1 5 10 15 

Thr Phe Leu Asp Asp Ser lie Cys Thr 
20 25 

30 

SEQ ID NO: 84 
Length: 25 
Type : PRT 

Organism: Artificial Sequence 

35 
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Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
Sequence: 84 

5 Gin Val Thr Glu Val Cys Ala Ala Ala He Asn Met Gly Gly Gly lie 



Thr Phe Leu Asp Asp Ser He Cys Thr 
20 25 

10 

SEQ ID NO: 85 
Length: 25 
Type : PRT 
15 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 

20 Sequence: 85 

Gin Val Tfc_z Giu Val Cys Aia Ala Thr Tyr Pro Tyr Asp Val Pro Asp 
15 ic !5 

Tyr Ala Leu Asp Asp Ser He Cys Thr 



1 



5 



10 



15 



25 



20 



25 



SEQ ID NO: 86 



Length: 25 



30 



Type : PRT 



Organism: Artificial Sequence 



35 



Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
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Sequence: 86 

Gin Val Thr Glu Val Cys Ala Ala Ala lie Ser Gly Ser Gly Gly Ser 
15 10 15 

5 Thr Tyr Leu Asp Asp Ser lie Cys Thr 
20 25 



SEQ ID NO: 87 
10 Length: 25 
Type : PRT 

Organism: Artificial Sequence 
Feature: 

15 Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

Sequence : 37 

Sir Val Tzlz 31- Val Cys Ala Ala Thr Tyr Xaa Xaa Gly Xaa Glu Leu 
20 1 5 10 15 

Thr Phe Leu Asp Asp Sex He Cys Thr 
20 25 

25 

SEQ ID HO: 88 
Length: 25 
Type : PRT 

Organism: Artificial Sequence 

30 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 
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Sequence: 88 

Gin Val Thr Glu Val Cys Ala Ala Cys Tyr Xaa Xaa Gly Xaa Glu Leu 
IS 10 is 



5 Thr Phe Leu Asp Asp Ser lie Cys Thr 

20 25 



SEQ ID NO: 89 
10 Length: 13 
Type : PRT 

Organism: Homo sapiens 
Sequence: 89 

15 Cys Lys Val Glu Leu Met Tyr Pro Pro Pro Tyr Tyr Leu 
1 S 10 



SEQ ID HO: 90 
20 Length: 27 
Type: ?FET 

Organise: Artificial Sequence 



Feature: 

25 Other information: Description of Artificial Sequence: Fusion protein 
Sequence: 90 

Cys Lys Val Asp Ser Thr lie Tyr Ala Ser Tyr Tyr Glu Cys Gly His 
1 5 io is 



30 



Gly Leu Ser Thr Gly Gly Tyr Gly Tyr Asp Ser 
20 25 
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SEQ ID NO: 91 
Length: 18 
Type: PRT 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
Sequence: 91 

10 Cys Lys Val Glu Ala Gly Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr 
15 10 15 

Ser Cys 

15 

SEQ ID NO: 92 
Length : 1 8 

20 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 

25 Sequence: 92 

Cys Lys Val Gly Trp Gly Leu Arg Gly Glu Glu Gly Asp Tyr Tyr Met 
1 5 . 10 is 

Asp Val 

30 

SEQ ID NO: 93 
Length: 14 
35 Type: PRT 

Organism: Artificial Sequence 



WO 99/451 10 



37 



PCT/AU99/00136 



Feature : 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

5 Sequence: 93 

Cys Lys Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 



10 SEQ ID NO: 94 
Length: 18 
Type: PRT 

Organism: Artificial Sequence 
15 Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
ccaca : r ^.ag rar.dsn sequence 

Sequence: 

20 Cys Lys Val Xaa Xaa Xa=. Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 



25 

SEQ ID NO: 95 
Length: 14 
Type : PRT 
30 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

35 
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Sequence: 95 

Cys Lys Val Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa 
1 5 jo 



SEQ ID NO: 96 
Length: 15 
Type : PRT 

Organism: Artificial Sequence 

10 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 



15 Sequence: 96 

Cys Lys Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa 
1 5 io 15 



20 SEQ ID NO: 97 
Length: 17 
Type : PRT 

Organism: Artificial Sequence 
25 Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

Sequence: 97 

30 cys Lys Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa 
1 5 io 15 

Xaa 



35 
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SEQ ID NO: 9B 
Length: 21 
Type : PRT 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

10 Sequence: 98 

Cys Lys Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Xaa 
15 20 



SEC ID SO: 99 

20 Type: PRT 

Crganisa: Artificial Sequence 

Feature: 

Cther information: Description of Artificial Sequence: Fusion protein 
25 containing random sequence 

Sequence: 99 

Cys Lys Val 61 u Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 

30 

SEQ ID NO: 100 
Length: IB 
Type : PRT 
35 Organism: Artificial Sequence 
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Feature: 

Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

5 Sequence; 100 

Cya Lys Val Glu Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa 
15 10 15 

Xaa Xaa 



SEQ ID NO: 101 
15 Length: 16 
Type: PRT 

Organism: Artificial Sequence 
Feature: 

20 Other information: Description of Artificial Sequence: Fusion protein 
containing random sequence 

Sequence: 101 

Cys Lys Val Glu Ala Gly Cys Lys Asn Xaa Xaa Xaa Xaa Xaa Xaa Thr 
25 1 5 io 15 

Ser Cys 



30 



35 



SEQ ID NO: 102 
Length: 10 
Type : PRT 

Organism: Artificial Sequence 
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Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

5 Sequence: 102 

Leu Pro Ser Ser Asp Thr Arg Ala Tyr Ser 
15 10 



10 SEQ ID NO: 103 
Length: 8 
Type: PRT 

Organism: Artificial Sequence 
15 Feature: 

Other information: Description of Artificial Sequence: CDRJ. and CDR3 
inserts possessing randomly generated sequence 

Sequence: 103 
20 Gin Glu Ser Gly Gly Arg Pro Gly 
1 5 



SEQ ID NO: 104 
25 Length: 10 
Type: PRT 

Organism: Artificial Sequence 



30 



Feature: 

Other information: Description of Artificial Sequence: CDRl and CDR3 
inserts possessing randomly generated sequence 



35 



Sequence: 104 

Leu Pro Arg Gly Pro Pro Leu Leu Ser Leu 
15 10 
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SEQ ID NO: 105 
Length: 7 
Type: PRT 
5 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

10 

Sequence: 105 
Ser Pro Gly Arg Cys Leu Asn 
1 5 

15 

SEQ ID NO: 106 
Length: 8 
Type: PRT 

Organism: Artificial Sequence 

20 

Feature: 

Name/Key : HODRES 
Location: (5) 

Other information: Stop codon but Glu when expressed in Tg-1 or JM109 
25 strains of E-coli 

Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

30 

Sequence: 106 

Glu Trp Lys Arg Glu His Gly Gly 
1 5 



35 
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SEQ ID NO: 107 
Length: 10 
Type: PRT 

Organism: Artificial Sequence 

5 

Feature : 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
-inserts possessing -randomly generated sequence 

10 Sequence: 107 

Leu Cys Pro Gly Ala Cys Gly Cys Val Tyr 
15 10 



15 SEQ ID NO: 108 
Length : 8 
Type : PRT 

Organism: Artificial Sequence 



20 Feature: 

Name/ Key : MQD_RES 
Location: (4) 

Other information: Stop codon but Glu when expressed in Tg-1 of JM109 
E.coli strains 

25 

Feature : 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 



30 



Sequence: 108 

Asn Ser Gly Glu Asn Glu Gly Gly 
1 5 
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SEQ ID NO: 109 
Length: 8 
Type : PRT 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

10 Sequence: 109 

Asp Lys Pro Val Thr Lys Ser Gly 
1 5 



15 SEQ ID NO: 110 
Length: 7 
Type: PRT 

Organism: Artificial Sequence 



20 Feature: 

Name/Key: MOD_RES 
Location: (7) 

Other information: Stop codon but Glu when expressed in Tg-1 or JM109 
strains of E. coll 

25 

Feature : 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 



30 



Sequence : 110 
Ser Pro Gly Ala Cys Pro Glu 
1 5 
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SEQ ID NO: 111 
Length: 7 
Type : PRT 

Organism: Artificial Sequence 

5 

Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

10 Sequence: 111 

Ser Pro Gly Lys Cys Asp Gin 
1 5 



15 SEQ. ID HO: 112 
Length: 7 
Type: PRT 

Organism: Artificial Sequence 



20 Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 



Sequence: 112 
25 Ser Pro Gly Met Cys Ala Arg 
1 5 



SEQ ID NO: 113 
30 Length: 11 
Type: PRT 

Organism: Artificial Sequence 



10 
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Feature: 

Maine/ Key: MOD_RES 
Location: (8) 

Other information: Stop codon but Glu when expressed in Tg-1 or JN109 
strains of E. coli 

Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

Sequence: 113 

Pro Phe Leu Phe Leu pro Cys Glu Phe Phe Phe 
1 & 10 



15 

SEQ ID NO: 114 
Length: 11 
Type : PRT 

Organism: Artificial Sequence 

20 

Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDPJ 
inserts possessing randomly generated sequence 

25 Sequence: 114 

Trp Thr Leu Gly His His Lys Leu Cya Glu Gly 
1 5 10 



30 SEQ ID NO: 115 
Length: 10 
Type : PRT 

Organism: Artificial Sequence 
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Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

5 Sequence: 115 

Leu Phe Thr Cys Leu Leu Ala Leu Cys Ser 
15 10 



10 SEQ ID WO: 116 
Length: 7 
Type : PRT 

Organism: Artificial Sequence 
15 Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

Sequence: 116 
20 Ser Pro Gly Glu Cys Tyr Gly 

1 5 



SEQ ID NO: 117 
25 Length: 13 
Type: PRT 

Organism: Artificial Sequence 
Feature: 

30 Other information: Description of 
inserts possessing randomly 



Artificial Sequence: CDR1 and CDR3 
generated sequence 



35 



Sequence: 117 

Ser Trp Leu Ser Thr Thr Xaa Cys Leu Ser Ser Cys Set 
1 5 10 
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SEQ ID NO; 118 
Length: 7 
Type : PRT 
5 Organism: Artificial Sequence 



Feature: 

Name/Key: MOD_RES 
Location: (4) 

10 Other information: Stop codon but Glu when expressed in Tg-1 or JM109 
strains of E. coli 

Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
15 inserts possessing randomly generated sequence 



20 



Sequence: 118 
Ser Pro Gly Glu Cys Gin Asp 
1 5 



SEQ ID NO: 119 
Length: 13 
Type : PRT 
25 Organism: Artificial Sequence 



Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

Sequencet 119 

Leu Leu Gly Ser Leu Leu Ser Cys Phe Ala Ser Leu Ser 
1 5 io 



35 
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SEQ ID NO: 120 
Length: 13 
Type: PRT 

Organism: Artificial Sequence 

5 

Feature: 

Other information: . Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

10 Sequence: 120 

Ser Pro Gly Ser Leu Leu Ser Cys Phe Ala Ser Xaa Ser 
1 5 10 



SEQ ID NO: 121 
Length: 7 
Type : PRT 

Organism: Artificial Sequence 

Feature- 
Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 



25 Sequence: 121 

Ser Pro Gly Arg Cys Thr Asp 
1 5 



30 SEQ ID NO: 122 
Length: 13 
Type : PRT 

Organism: Artificial Sequence 
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Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

5 Sequence: 122 

Val He Cys His Ser Ser Val Cys Leu Ser Asp Val Cys 
1 5 10 



10 SEQ ID NO: 123 
Length: 13 
Type: PRT 

Organism: Artificial Sequence 
15 Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

Sequence: 123 

20 Val He Cya His Ser Ser Val Cys Leu Ser Glu Val Cys 

1 5 10 



SEQ ID NO: 124 
25 Length: 10 
Type: PRT 

Organism: Artificial Sequence 



30 



Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 



35 



Sequence: 124 

Asp Leu Pro Ser Tyr Leu Ala Cys Ser He 
1 5 10 
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SEQ ID NO: 125 
Length : 7 
Type : PRT 
5 Organism: Artificial Sequence 



10 



15 



Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

Sequence: 125 
Ser Pro Gly Arg Cys Asp Ala 
1 5 



SEQ ID NO: 126 
Length: 14 
Type: PRT 

Organism: Artificial Sequence 

20 

Feature: 

other information: Description of Artificial Sequence: CDRI a=c CXF5 
inserts possessing randomly generated sequence 

25 Sequence: 126 

Ala Leu Cys Trp Asp Val Phe Tyr Cys Ser Phe Pro Ser Tyr 
1 "5 10 



30 SEQ ID NO: 127 
Length: 11 
Type : PRT 

Organism: Artificial Sequence 
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Feature : 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

5 Sequence: 127 

Glu Leu Phe Gly His Ala Arg Tyr Cys Lys Gly 
1 5 10 



10 SEQ ID NO: 128 
Length: 15 
Type : PRT 

Organism: Artificial Sequence 



15 Feature: 

Name/ Key: M0D_RES 
Location: (7) 

Other information: Stop codon but Glu when expressed in Tg-1 of JM019 
strains of E. coli 

20 

Feature: 

Other information: Dea.-r^n^xae -z Artificial Seque==e: rD=u and CDR3 
inserts possessing randomly generated sequence 

25 Sequence: 128 

Val Ser lie Thr Ser Pro Glu Leu Cys Pro Val Lys Val Phe Asp 
15 10 15 



30 SEQ ID NO: 129 
Length : 7 
Type : PRT 

Organism: Artificial Sequence 
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Feature : 

Name/Key: MOD_RES 
Location: (6) 

Other information: Stop codon but Glu when expressed in Tg-1 or JM109 
5 strains of E. coli 



Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

Sequence: 129 
Ser Pro Gly Lys Val Glu Asn 
1 5 



15 

SEQ ID NO: 130 
Length: 8 
Type : PRT 

Organism: Artificial Sequence 

20 

Feature: 

Other information: Description of Artificial ^ — CDR1 and OR3 
inserts possessing randomly generated sequence 

25 Sequence : 130 

Leu Phe Val Pro Phe Val Ser Pro 
1 5 



30 SEQ ID NO: 131 
Length: 7 
Type : PRT 

Organism: Artificial Sequence 
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Feature : 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

5 Sequence : 131 

Ser Pro Gly Asp Leu Trp Val 
1 5 



10 SEQ ID NO: 132 
Length: 15 
Type: PRT 

Organism: Artificial Sequence 
15 Feature: 

Other information: Description cf Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly genera-ted sequence 

Sequence: 132 

20 Glu Ser Gly Leu Ser Pro Val Ser Pro Cys Ser Leu Tyr Ser Leu 

1 5 1C 15 



SEQ ID NO: 133 
25 Length: 8 
Type : PRT 

Organism: Artificial Sequence 



Feature: 

30 Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 



35 



Sequence: 133 

Thr Ser Ala Asn Gly Pro Tyr Gly 
1 5 
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SEQ ID NO: 134 
Length: 10 
Type: PRT 
5 Organism: Artificial Sequence 

Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 



10 



15 



20 



Sequence : 134 

Pro Trp Ala Tyr Arg Phe Leu Ala Val Leu 
1 5 io 



SEQ ID NO: 135 
Length: 6 
Type : PRT 

Organism: Artificial Sequence 



Feature: 

Other information: Description of Artificial Sequ&nca: C3R1 ar=c d>R3 
inserts possessing randomly generated sequence 

25 Sequence: 135 

Arg Lys Thr Arg Glu Lys Tyr Gly 
1 5 



30 SEQ ID NO: 136 
Length: 12 
Type : PRT 

Organism: Artificial Sequence 
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Feature: 

Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

Sequence: 136 

Glu Leu Met Tyr Pro Pro Pro Tyr Tyr Leu Gly lie 
1 5 10 



10 SEQ ID NO: 137 
Length : 7 
Type: PRT 

Organism: Artificial Sequence 
15 Feature: 

Other information: Description cf Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

Sequence: 137 
20 Ser Pro Gly Gin Glu le- Tar 
1 5 



SEQ ID NO: 138 
25 Length: 15 
Type : PRT 

Organism: Artificial Sequence 



Feature: 

30 Other information: Description of Artificial Sequence: CDR1 and CDR3 
inserts possessing randomly generated sequence 

Sequence: 138 

Glu Leu Phe Phe Leu Leu Tyr Ala Pro Cys Tyr Leu Phe Gin Arg 
35 1 5 10 is 
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Claims: 

1. A binding moiety comprising at least one monomeric V-like domain 
(VLD) derived from a non-antibody ligand, the at least one monomeric V-like 

5 domain being characterised in that at least one CDR loop structure or part 
thereof is modified or replaced such that the solubility of the modified VLD 
is improved when compared with the unmodified VLD. 

2. A binding moiety according to claim 1 in which at least one CDR loop 
10 structure or part thereof is modified or replaced such that 

(i) the size of the CDR loop structure is increased when compared 
with the corresponding CDR loop structure in the unmodified VLD; and/or 

(ii) the modification or replacement results in the formation of a 
disulphide bond within or between one or more of the CDR loop structures. 

15 

3. A binding moiety comprising at least one moncmeiic V-like domain 
(VLD) derived from a non-antibody ligand, the at least one monomeric V-like 
domain being characterised in that at least one CDR loop structure or part 
thereof is modified or replaced such that 

20 (i) the size of the CDR loop structure is altered when compared with 

the corresponding CDR loop sli mlin e in the unrnceinesi VLD; and/or 

(ii) the mod ific a tio n or replacement resrdts in the formation of a 
disulphide bond within or between one or more of the CDR loop structures, 

25 4. A binding moiety according to claim 3 in which the size of the CDR 
loop structure is increased by at least two amino acid residues. 

5. A binding moiety according to claim 3 in which the size of the CDR 
loop structure is increased by at least six amino acid residues. 

30 

6. A binding moiety according to claim 3 in which the size of the CDR 
loop structure is increased by at least nine amino acid residues. 

7. A binding moiety according to any one of claims 1 to 6 in which the 
35 binding affinity of the modified VLD is altered when compared with the 

unmodified VLD. 
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8. A binding moiety according to claim 7 in which the affinity of the 
modified VLD to at least one natural ligaud of the unmodified VLD is 
reduced. 

5 

9. A binding moiety according to any one of claims 1 to 8 in which the 
binding specificity of the modified VLD is different to that of the unmodified 
VLD. 

10 10. A binding moiety according to any one of claims 1 to 9 in which the 
non-antibody ligand is a T-cell surface protein. 

11. A binding moiety according to claim 10 in which the non-antibody 
ligand is CTLA-4, CD28 or ICOS. 

15 

12. A binding moiety according to claim 11 in which the non-antibody 
ligand is CTLA-4. 

13. A binding moiety according to any one of claims 1 to 12 in which one 
20 or more of the CDR loop structures is replaced with a binding determinant 

derived from a non-antibody polypeptide. 

14. A binding moiety according to claim 13 in which the binding 
determinant is derived from somatostatin or haemagglutinin. 

25 

15. A binding moiety according to any one of claims 1 to 12 in which one 
or more of the CDR loop structures is replaced with one or more CDR loop 
structures derived from an antibody or antibodies. 

30 16. A binding moiety according to claim 15 in which the antibody or 
antibodies are derived from a rat, mouse, human, camel, llama or shark. 



35 



17. • A binding moiety according to claim 15 or claim 16 in which the 
antibody or antibodies are selected from the camel antibody cAB-Lys3 and 
the human anti-melanoma antibody V86. 
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18. A binding moiety according to any one of claims 1 to 17 linked to a 
diagnostic reagent 

19. A binding moiety according to claim 18 in which the diagnostic 
5 reagent is selected from the group consisting of streptavidin, biotin , a 

radioisotope, dye marker or other imaging reagent. 

20. A multivalent reagent comprising two or more binding moieties as 
claimed in any one of claims 1 to 19. 

10 

21. A binding moiety or multivalent reagent according to any one of claims 
1 to 20 immobilised on a solid support or coupled to a biosensor surface. 

22. A polynucleotide encoding a binding moiety or multivalent reagent as 
15 claimed in any one of claims 1 to 20. 

23. A vector comprising a polynucleotide according to claim 22. 

24. A host cell transformer with a as daimed in claim 23. 

20 

25. A best cell according to claim 24 in which the cell is a bacterial cell. 

26. A method of producing a binding moiety which comprises cul hiring a 
host cell as claimed in claim 24 or claim 25 under conditions enabling 

25 expression of the binding moiety and optionally recovering the binding 
moiety. 

27. A method according to claim 26 in which the binding moiety is 
unglycosylated. 

30 

28. A pharmaceutical composition comprising a binding moiety as claimed 
in any one of claims 1 to 20 and a pharmaceutical ly acceptable carrier or 
diluent 
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29. A method of treating a pathological condition in a subject, which 
method comprises administering to the subject a binding moiety as claimed 
in any one of claims 1 to 20. 

5 30. A method of selecting a binding moiety with an affinity for a target 
molecule which comprises screening a library of polynucleotides for 
expression of a binding moiety with an affinity for the target molecule, the 
polynucleotides encoding VLDs derived from one or more non-antibody 
ligands, wherein the polynucleotides have been subjected to mutagenesis 
10 which results in a modification or replacement in at least one CDR loop 
structure in at least one VLD and wherein the solubility of the isolated 
modified VLD is improved when compared with the isolated unmodified 
VLD. 

15 31. A method according to claim 30 in which the screening process 

involves displaying the modified V-like domains as gene HI protein fusions 
an the surface of bacteriophage particles. 

32. A method according to claim 30 in which the screening process 
20 involves displaying the modified V-like domains in a ribosomal display 

selection system. 

33. A binding moiety according to any one of claims 1 to 20 produced by a 
method according to any one of claims 30 to 32. 
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